Eugene A.
Manila, Philippines
Video Streaming | WebRTC | GStreamer | VOIP | FFMPEG | AI | OpenCV
$45.00/hr
Hi there!
With robust expertise in firmware embedded programming and backend web API technologies, I also possess comprehensive experience across various software domains, including desktop application development, video streaming, and OpenCV image processing. I am proficient in multiple programming languages, with extensive hands-on experience in Golang, C#, C/C++, and JavaScript. My technical proficiency is further highlighted by my strong background in OpenCV image processing, OpenGL/DirectX 3D rendering, and multi-channel video/audio streaming using GStreamer.
I am committed to continually advancing my technical skills and take pride in delivering exceptional service and solutions.
💠Areas of Expertise:
🗸Backend Development: Proficient in building robust and scalable backend systems.
🗸Firmware and Embedded Software Development: Skilled in developing reliable and efficient embedded systems.
🗸Desktop Application Development: Extensive experience in developing high-quality desktop applications.
🗸Streaming Technologies: Expertise in multi-channel video and audio streaming using GStreamer.
🗸Image Processing: Strong background in OpenCV for advanced image processing tasks.
🗸3D Rendering: Proficient in OpenGL and DirectX for high-performance 3D rendering.
💠Professional Attributes:
🗸Language Agnostic: Versatile in adopting and working with various programming languages.
🗸Continuous Learning: Dedicated to mastering new skills and technologies.
🗸Quality Service: Committed to providing excellent service and delivering high-quality solutions.
💠Programming Languages and Frameworks:
🗸Golang
🗸C#, ASP.NET, ASP.NET MVC, ASP.NET Core, Blazor
🗸JavaScript, Angular, React.js
🗸C/C++, Win32 APIs, MFC
🗸Qt, QML
Computer vision engineer: Neural Networks, OpenCV, CUDA, Git, Linux, Qt, Boost, OpenGl, PCL, SLAM Strong math background, C++, Python, Pillow, Numpy
✔Projects:
→ Hebrew Document Checking
→ Hugging face LLM models merging
→ E-commerce website Recommendation system
→ Face Liveness Detection
→ Wav2Lip
→ Traffic light detection
→ German Invoice Data Extraction
→ Skin Cancer Detection
→ Social Distancing Detector
→ Number Plate Detection
→ Image Reconstruction
→ Image Enhancer
→ Human Fall Detection
→ Urdu Automated Speech Recognition (ASR)
→ Handwritten Descriptive Answer Evaluation
→ Handwritten MCQs Detection
✔Machine learning engineer;
• YOLO, OpenCV, CUDA, Darknet, SegNet, TensorFow, Pytorch,
Machine learning research projects in the following domains:
- person segmentation (ModNet, RVM, TDNet, UCTransNet, XMem etc.)
- image inpainting (Pen-Net, Deepfillv2, Shift-Net, ViNet etc.)
- image upscale (RDN, RRDN, Stable Diffusion, ISR etc.)
- image relighting (Total Relighting, DPR, RelightNet etc.)
- road segmentation for unmanned vehicles (ENet, Caffe, OpenCV, C++, Linux)
- car tracking (Yolo v3, OpenCV, C++, Linux)
- wagon number identification (Yolo v4, Python)
- implementation of real time 360°/perspective camera transformation on Cuda (C++,
Cuda, OpenCV, Linux, Jetson Nano)
- distance calculation to point on 2D camera frame (C++, OpenCV, Linux)
- automate grading system for handwritten answer sheets (computer vision part, OpenCV,
Java - Android, IOS - Swift)
→ EDA and Pre-Processing of Dataset (Structured/Unstructured)
→ Transfer Learning of state-of-the-art models
→ Model Fine Tuning
→ Machine Learning (SVM, KNN, Regressions, Decision Trees, Random Forest, Ensemble, Time Series)
→ Deep Learning (ANN, CNN, YOLO, LSTMs, RNN)
→ OCRs (EasyOCR, PaddleOCR, GoogleOCR, Tesseract)
→ Image Generative models (Inpainting, Diffusion models, Dall-E, StyleGAN)
→ Text-to-speech and speech-to-text models in many languages.
→ LLM models(Langchain, LLama, BERT, OpenAI API integration, Prompt Engineering, GPT fine-tuning, Gemini, Mistral, RAG)
→ NLP (Spacy, NLTK, NER, word2Vec, TF/IDF)
Thank you for your time. 🙏
Skills
Skills
- FFmpeg
- WebRTC
- VoIP Software
- GStreamer
- VoIP
- Python
- C
- Node.js
- YOLO
- Broadcast Engineering