Eugene A.
ManilaPhilippines

Video Streaming | WebRTC | GStreamer | VOIP | FFMPEG | AI | OpenCV

Hi there! With robust expertise in firmware embedded programming and backend web API technologies, I also possess comprehensive experience across various software domains, including desktop application development, video streaming, and OpenCV image processing. I am proficient in multiple programming languages, with extensive hands-on experience in Golang, C#, C/C++, and JavaScript. My technical proficiency is further highlighted by my strong background in OpenCV image processing, OpenGL/DirectX 3D rendering, and multi-channel video/audio streaming using GStreamer. I am committed to continually advancing my technical skills and take pride in delivering exceptional service and solutions. 💠Areas of Expertise: 🗸Backend Development: Proficient in building robust and scalable backend systems. 🗸Firmware and Embedded Software Development: Skilled in developing reliable and efficient embedded systems. 🗸Desktop Application Development: Extensive experience in developing high-quality desktop applications. 🗸Streaming Technologies: Expertise in multi-channel video and audio streaming using GStreamer. 🗸Image Processing: Strong background in OpenCV for advanced image processing tasks. 🗸3D Rendering: Proficient in OpenGL and DirectX for high-performance 3D rendering. 💠Professional Attributes: 🗸Language Agnostic: Versatile in adopting and working with various programming languages. 🗸Continuous Learning: Dedicated to mastering new skills and technologies. 🗸Quality Service: Committed to providing excellent service and delivering high-quality solutions. 💠Programming Languages and Frameworks: 🗸Golang 🗸C#, ASP.NET, ASP.NET MVC, ASP.NET Core, Blazor 🗸JavaScript, Angular, React.js 🗸C/C++, Win32 APIs, MFC 🗸Qt, QML Computer vision engineer: Neural Networks, OpenCV, CUDA, Git, Linux, Qt, Boost, OpenGl, PCL, SLAM Strong math background, C++, Python, Pillow, Numpy ✔Projects: → Hebrew Document Checking → Hugging face LLM models merging → E-commerce website Recommendation system → Face Liveness Detection → Wav2Lip → Traffic light detection → German Invoice Data Extraction → Skin Cancer Detection → Social Distancing Detector → Number Plate Detection → Image Reconstruction → Image Enhancer → Human Fall Detection → Urdu Automated Speech Recognition (ASR) → Handwritten Descriptive Answer Evaluation → Handwritten MCQs Detection ✔Machine learning engineer; • YOLO, OpenCV, CUDA, Darknet, SegNet, TensorFow, Pytorch, Machine learning research projects in the following domains: - person segmentation (ModNet, RVM, TDNet, UCTransNet, XMem etc.) - image inpainting (Pen-Net, Deepfillv2, Shift-Net, ViNet etc.) - image upscale (RDN, RRDN, Stable Diffusion, ISR etc.) - image relighting (Total Relighting, DPR, RelightNet etc.) - road segmentation for unmanned vehicles (ENet, Caffe, OpenCV, C++, Linux) - car tracking (Yolo v3, OpenCV, C++, Linux) - wagon number identification (Yolo v4, Python) - implementation of real time 360°/perspective camera transformation on Cuda (C++, Cuda, OpenCV, Linux, Jetson Nano) - distance calculation to point on 2D camera frame (C++, OpenCV, Linux) - automate grading system for handwritten answer sheets (computer vision part, OpenCV, Java - Android, IOS - Swift) → EDA and Pre-Processing of Dataset (Structured/Unstructured) → Transfer Learning of state-of-the-art models → Model Fine Tuning → Machine Learning (SVM, KNN, Regressions, Decision Trees, Random Forest, Ensemble, Time Series) → Deep Learning (ANN, CNN, YOLO, LSTMs, RNN) → OCRs (EasyOCR, PaddleOCR, GoogleOCR, Tesseract) → Image Generative models (Inpainting, Diffusion models, Dall-E, StyleGAN) → Text-to-speech and speech-to-text models in many languages. → LLM models(Langchain, LLama, BERT, OpenAI API integration, Prompt Engineering, GPT fine-tuning, Gemini, Mistral, RAG) → NLP (Spacy, NLTK, NER, word2Vec, TF/IDF) Thank you for your time. 🙏
Skills

Skills

  • FFmpeg
  • WebRTC
  • VoIP Software
  • GStreamer
  • VoIP
  • Python
  • C
  • Node.js
  • YOLO
  • Broadcast Engineering