Search Freelance Talent on Upwork

Pavlo G.

AI Engineer | Data Scientist | Machine Learning | Computer Vision

Ukraine

$75/hr

100% Job Success

$500K+ earned

Available now

Offers consultations

Top Rated Plus | Top 10 Machine Learning Agency on Upwork | $500K+ Earned | 8+ Years in AI & Software Engineering I'm an AI Engineer building production-grade AI agent and RAG systems, not simple prompt wrappers. With 8+ years in software engineering and AI development and $500K+ earned on Upwork, I hold Top Rated Plus status and our team is ranked among the Top 10 Machine Learning agencies on the platform. I work with companies that need an AI system to actually run in production, handle real user traffic, and stay accurate, not a demo that breaks on the first edge case. As an AI Engineer, my core work covers retrieval-augmented generation pipelines, agentic workflows with tool calling and structured outputs, prompt engineering, and LLM integration with OpenAI, Anthropic, and Gemini APIs. This is the kind of system clients need an AI Engineer for: not a chatbot that answers FAQs, but an agent that retrieves the right information, calls the right tools, and follows through on what the user actually needs, with hallucination reduction and evaluation built in from the start. As a Machine Learning Engineer and Data Scientist, I build systems for structured and time-series data: demand forecasting, anomaly detection, biomedical signal analysis, and structural health monitoring. My data scientist workflow covers Python, scikit-learn, pandas, NumPy, and SciPy alongside deep learning frameworks including TensorFlow, PyTorch, and Keras, with experiment tracking and evaluation metrics to ensure models perform consistently in production. When a project needs predictive or classification models alongside the AI agent itself, I own that layer too as a Machine Learning Engineer. As a Computer Vision Engineer and software engineer, I build object detection, multi-object tracking, pose estimation, and image segmentation systems using OpenCV, YOLO, and deep learning architectures. Where this intersects with the AI Engineer work is in multimodal systems and computer vision agents: I work with Vision Language Models (VLMs) to build AI pipelines that understand images and video, not just text. Most AI Engineers only work with text. I bring production computer vision and deep learning experience on top of the LLM layer, which matters for any product where the AI needs to see, not just read. On the engineering side, I work as a Python developer and software engineer building backend services with FastAPI, vector databases including Pinecone and pgvector, and LangChain or custom orchestration for multi-step agent logic. When a project needs full ownership of both the AI layer and the surrounding application, I work as a Full Stack AI Developer, handling backend APIs, database design, and frontend integration so the AI system ships as a complete product, not just a backend script. I work with a specialized team that includes a computer vision PhD, deep learning researchers, and mathematical optimization specialists. This lets me scope larger systems, split orchestration, retrieval, and evaluation work across the team, and deliver a full AI Engineer and Machine Learning Engineer engagement faster than a solo contributor could, with the software engineering discipline of clean APIs, logging, and testing baked in from day one. Clients typically work with me when they need: - an AI Engineer to build a RAG pipeline, AI agent, or chatbot that actually works in production - a Machine Learning Engineer or Data Scientist to build predictive models or structured data pipelines - a Computer Vision Engineer to add visual understanding or VLM-based reasoning to an AI product - a Software Engineer who understands LLM orchestration, tool calling, vector search, and backend architecture - a Python developer who can own the full stack from model training to deployed API If you need an AI Engineer and a software engineer with the full stack from prompt design to production deployment, let's talk. Main stack: Python, OpenAI API, Anthropic Claude, Gemini, LangChain, LangGraph, LlamaIndex, FastAPI, Pinecone, pgvector, TensorFlow, PyTorch, Keras, OpenCV, YOLO, Docker, PostgreSQL, JavaScript, Git.

Associated with

Requestum

$9M+

earned

Muhammad A.

Edge AI & Computer Vision Engineer | AI Agent | LLM | RAG

Pakistan

$50/hr

100% Job Success

$600K+ earned

Available now

Hello, I’m the founder of StreamTech, with over 11,000 hours across more than 100 AI and machine learning projects since 2016. I bring deep expertise in computer vision and edge AI ranging from object detection, tracking, and OCR to pose estimation, generative image processing, and event detection in sports feeds. I also excel in building intelligent AI agents and LLM driven chatbots that leverage multi turn dialog, RAG enabled memory systems, and API orchestration. My offerings extend beyond AI models and edge deployment. I provide full mobile experience solutions, creating cross platform React Native apps complemented by thoughtful UI/UX design. Whether it's crafting intuitive interfaces, responsive layouts, or seamless animations tailored for both iOS and Android, I ensure that the user experience complements the underlying AI technology. I guide projects end to end, collecting and labeling data, architecting and training models with PyTorch and TensorFlow, and deploying solutions either in the cloud (AWS, GCP) or on edge devices like Jetson Nano, Xavier, and Orin using DeepStream SDK. My engineering stack includes Python, C++, OpenCV, MediaPipe, OpenPose, SMPL, GANs, Stable Diffusion, Docker, and Kubernetes. At StreamTech, our mission has always been to harness cutting edge tech for meaningful business impact. By blending AI innovation with elegant mobile design, I help entrepreneurs and managers accelerate product development. If you're envisioning a mobile solution powered by CV or conversational AI, or need an AI agent interface that shines on mobile, let’s connect and explore how we can craft something exceptional together. Cheers!!

Kareem M.

AI Engineer (Computer Vision, Deep Learning, Machine Learning, NLP)

Egypt

$30/hr

100% Job Success

$8K+ earned

Hello, I'm Kareem! I'm an AI Engineer specializing in Machine Learning, Deep Learning, NLP, and Computer Vision, dedicated to building scalable, data-driven solutions that drive real innovation. Whether it’s intelligent chatbots, automation, or advanced AI models, I craft solutions tailored to business needs—optimizing workflows, extracting deep insights, and enhancing efficiency with cutting-edge AI. - My Promise to you 🚀 Expert Problem Solver: From simple tasks to complex challenges, I deliver solutions that are not just functional but highly efficient and robust. 📚 Comprehensive Documentation: Receive thorough documentation covering AI models, algorithms, and architectures, ensuring you fully understand the process and outcomes. 💡 Clear Code, Clear Results: My code is clean, well-commented, and designed for maintainability, making it easy to follow and adapt. 💬 Responsive Support: Need assistance? I guarantee a same-day response, providing you with consistent support throughout the project. 🔍 Stable and Thoroughly Tested Solutions: I ensure that your solution is stable, well-tested, and handles edge cases with ease, minimizing any risk of issues. 🔄 Flexible and Agile Development: Requirement changes? No problem. I adapt swiftly, ensuring full cooperation and seamless adjustments as your project evolves. 💡 My Expertise 🔹 Core Skills ✔ Machine Learning & Deep Learning ✔ Natural Language Processing (NLP) ✔ Computer Vision & Image Processing ✔ Data Analysis & Visualization ✔ Web Scraping & Automation 🔹 Programming Languages ✔ Python | C++ 🔹 AI & ML Frameworks ✔ TensorFlow | PyTorch | Keras | Scikit-learn 🔹 Machine Learning & Deep Learning ✔ ML Models: Logistic Regression, Decision Trees, SVM, KNN, Ensemble Methods, PCA, Naive Bayes ✔ Deep Learning: CNNs, RNNs, Autoencoders, GANs, Transformers, Large Language Models (LLMs) 🔹 NLP & Chatbots ✔ Text Classification, Sentiment Analysis, Language Translation ✔ Chatbot Development (Llama 3, OpenAI) ✔ Summarization, Named Entity Recognition (NER) 🔹 Computer Vision ✔ Object Detection & Tracking | Image Classification | Image Segmentation ✔ Feature Extraction | Facial Recognition | Optical Character Recognition (OCR) 🔹 Web Scraping & Data Extraction ✔ Scrapy | BeautifulSoup | Selenium 🔹 Data Analysis & Databases ✔ Matplotlib | Seaborn | SQL Looking for an AI expert who brings expertise, professionalism, and a commitment to delivering scalable, high-impact solutions? Look no further! Contact me today to learn more about how I can help take your project to the next level.

Zeeshan J.

Computer Vision & OCR Specialist | Machine Learning | Top Rated

Pakistan

$25/hr

100% Job Success

Offers consultations

Hi, I'm Zeeshan 👋 Give me something to build and i'll build it so it works for you 24/7. Thats my promise! If you shoot me a invitation or message ill send you a personalised loom video back on how i may be able to help you!... I’m a Computer Vision & Document AI Engineer with more than half a decade of experience building production-grade image and video intelligence systems not demos, not notebooks. My core expertise is image processing and computer vision, especially where data is messy, multilingual, scanned, or operationally constrained. What I Actually Build: 1. OCR & Document Image Processing I design end-to-end OCR pipelines for real-world documents: Urdu & English OCR (printed + complex layouts) Preprocessing: binarization, skew correction, denoising, layout analysis Text detection & bounding-box post-processing Tesseract, custom deep-learning OCR, Pix2Text Searchable archives using FAISS & vector indexing 2. Computer Vision for Images & Video I build vision systems that generate usable metadata: Object detection, tracking, segmentation (YOLO-based pipelines) Video-to-metadata systems for analytics and downstream automation Image classification and visual feature extraction High-performance OpenCV + PyTorch pipelines 3. Vision-Driven AI Applications When needed, I wrap CV systems into clean, usable products: Flask / FastAPI backends for vision services OCR-powered document portals Image & video search using embeddings RAG pipelines only where they add real value I’m a strong fit if you need image processing, OCR, or computer vision systems that must actually work in production especially for scanned documents, archives, or video data. EXPERTISE: AI Automation | Computer Vision | Sports Analysis | Image Processing | OCR | Mediapipe | Landmarks detection | Object Detection | Object Classification | Doucment Automation | Deep Learning | RAG | Tensorflow | PyTorch | Sign Language Production | GANs | Diffusion Models | Vision Transformers

Denis K.

Video Surveillance CCTV Computer Vision OpenCV EdTech | Detection Dev

Mexico

$40/hr

100% Job Success

$600K+ earned

Available now

Video Surveillance, CCTV, Computer Vision, OpenCV, AI – I build CCTV, object detection, and real-time video systems that scale. 14,700+ hours and $600K+ earned on Upwork. 100% JSS, Top Rated Plus. 15 years of senior engineering + Spec-First Agentic Engineering. ⚡ 𝗙𝗿𝗲𝗲 𝗣𝗥𝗗. 𝗠𝗩𝗣 𝗶𝗻 𝟳-𝟭𝟰 𝗱𝗮𝘆𝘀. 𝗙𝗿𝗲𝗲 𝗰𝗼𝗱𝗲 𝗿𝗲𝘃𝗶𝗲𝘄 𝗼𝗿 𝗮 𝟮-𝘄𝗲𝗲𝗸 𝘁𝗿𝗶𝗮𝗹 𝗼𝗻 𝗮𝗻 𝗲𝘅𝗶𝘀𝘁𝗶𝗻𝗴 𝗰𝗼𝗱𝗲𝗯𝗮𝘀𝗲. ⚠️ Press the Invite button on my page and let's talk about your project. 𝗦𝗲𝗹𝗲𝗰𝘁𝗲𝗱 𝗢𝘂𝘁𝗰𝗼𝗺𝗲𝘀 → VALT - enterprise CCTV and video surveillance SaaS. 770 organizations, 50K+ users, 2,500 IP cameras. Grew from a $1M launch year to $9M ARR. Motion detection, HIPAA/GDPR storage, 12-year partnership. → MindBox - smart video surveillance with AI analytics. Anomaly detection, facial recognition, and forensic search across 50+ deployments with live camera-map feeds. → Tennis analytics - a computer-vision pipeline that tracks players and the ball from match video for automated stats. → ALDA - generative AI course builder. Co-designed with 70+ U.S. educators; partner schools serve 500,000+ students. +20% educator efficiency. → BrainCert - world's first WebRTC + HTML5 LMS. $3M revenue, 100K+ customers, 500M+ classroom minutes. 𝗜 𝗔𝗺 𝗠𝗼𝘀𝘁 𝗘𝘅𝗽𝗲𝗿𝗶𝗲𝗻𝗰𝗲𝗱 𝗜𝗻 → Video Surveillance and CCTV - custom CCTV SaaS with secure storage, IP-camera ingestion, motion detection, and forensic search → Computer Vision and Object Detection - OpenCV, YOLO, TensorFlow, PyTorch for anomaly detection, face recognition, background blur, and incident alerts → AI Full-Stack Apps - OCR, real-time translation, autonomous agents, and object recognition on top of video → Video Streaming and Players - RTMP, HLS, WebRTC, sub-second latency, custom players on AWS → E-Learning, EdTech, and Virtual Classrooms - LMS and course authoring, curriculum and instructional design, live WebRTC and Agora classrooms, interactive whiteboards, AI grading and assessments, generative-AI courses, SCORM/LMS integrations 𝗔𝗴𝗲𝗻𝘁𝗶𝗰 𝗘𝗻𝗴𝗶𝗻𝗲𝗲𝗿𝗶𝗻𝗴 & 𝗦𝗽𝗲𝗰-𝗙𝗶𝗿𝘀𝘁 𝗗𝗲𝘃𝗲𝗹𝗼𝗽𝗺𝗲𝗻𝘁 I work spec-first, not vibe-first. I write the technical spec and architecture before any code is generated. Then I drive AI agents (developer, tester, analyst) against strict guardrails to produce production-grade code. The hard parts I handle or verify myself: real-time computer vision, OpenCV detection pipelines, and high-load video backends. You get the speed of AI with the stability of a senior engineer. 𝗛𝗼𝘄 𝗪𝗲 𝗦𝘁𝗮𝗿𝘁 30-min discovery → spec-first plan with architecture and a ±10% estimate → AI dev environment with guardrails → MVP with CI/CD and a 2-week demo cadence. 𝗛𝗼𝘄 𝗬𝗼𝘂 𝗖𝗮𝗻 𝗪𝗼𝗿𝗸 𝗪𝗶𝘁𝗵 𝗠𝗲 → MVP from scratch - the fastest safe path to production with Spec-First Agentic Engineering → Team extension - I join as the lead on hard modules: computer vision, AI, video surveillance, WebRTC → Rescue mission - I fix, modernize, and stabilize existing codebases, including vibe-code cleanup 𝗧𝗲𝗰𝗵𝗻𝗼𝗹𝗼𝗴𝗶𝗲𝘀 𝗜 𝗨𝘀𝗲 𝗗𝗮𝗶𝗹𝘆 → Computer Vision and AI: OpenCV, YOLOv8, TensorFlow, PyTorch, Keras, Image Recognition, OpenAI API, Whisper, Deepgram, NLP → Video: WebRTC, Agora, FFmpeg, Wowza, Kurento, HLS/DASH, RTMP/RTSP → Backend: Node.js, NestJS, Express, Python (FastAPI), PHP/Laravel, C++ → Frontend: React, Vue, Angular, TypeScript → Infra: AWS, GCP, Docker, Kubernetes, PostgreSQL, MongoDB, Redis → Security: AES-256, HIPAA, GDPR, SOC II → EdTech: LMS/SCORM, virtual classrooms, interactive whiteboards, course authoring, quizzes and assessments, AI grading 𝗪𝗵𝘆 𝗙𝗼𝗿𝗮 𝗦𝗼𝗳𝘁 100% Upwork JSS · 914 jobs · $10M+ earned · 401K+ hours · 625+ projects since 2005 · 400+ clients across 17+ countries · Top WebRTC Developer 2022 and Top Telecom Developer 2024 · 1 of 400 devs passes our internal selection · estimates within ±10% of actuals. 𝗖𝗹𝗶𝗲𝗻𝘁 𝗙𝗲𝗲𝗱𝗯𝗮𝗰𝗸 "We've been working together for 10 years and continue. Fora Soft planned and built our entire Wowza-based video surveillance SaaS from scratch across web, iOS and Android." - Dustin, CTO @ VALT (50,000 users, 770 orgs, 2,500 IP cameras) ⚠️ Press the Invite button or message me – I'll send back an SRS, an estimate, and the fastest safe path to production.

Associated with

Fora Soft LLC

$10M+

earned

Basim A.

PhD Computer Vision Expert | Object Detection, Medical Imaging, GenAI

Australia

$100/hr

93% Job Success

$50K+ earned

Available now

Offers consultations

My research has been accepted at CVPR'25 and ICCV'25 — the two most competitive computer vision conferences in the world. I now bring that same precision to client projects. I'm a Computer Vision PhD (Griffith University, Postdoc at University of Melbourne) with 7+ years turning hard visual problems into production systems. My work spans: Object Detection & Tracking — custom YOLO, DETR, and ByteTrack pipelines for real-time performance on edge and cloud Medical Image Analysis — MRI/CT segmentation, diagnostic AI, clinical-grade accuracy using PyTorch and MONAI Generative AI & ComfyUI — Stable Diffusion fine-tuning, ControlNet pipelines, custom ComfyUI node development 3D Vision — LiDAR point cloud processing, depth estimation, multi-view stereo, NeRF reconstruction Semantic Segmentation — autonomous vehicle, remote sensing, and industrial inspection systems Past clients describe me as: "a true expert with strong knowledge", "delivers excellent work, clear and professional", "highly recommended AI consultant". I work with scale-ups and product teams who need senior-level Computer Vision expertise — not just a model, but a system that performs. Available now. Let's discuss your project.

Gabor K.

Applied AI & Data Scientist | Production ML & Decision Support

Serbia

$75/hr

100% Job Success

$200K+ earned

Offers consultations

Most companies sitting on years of operational data still can't answer their most important questions: What will fail next? Where are we losing money? What should we do differently? I build the AI and ML systems that answer those questions — reliably, in production, not just in demos. I'm a PhD-trained Senior Applied AI & Data Scientist with 10+ years of experience translating messy, fragmented, domain-specific data into working decision-support systems. My background spans industrial operations, healthcare, agriculture, retail, and oil & gas — environments where the data is imperfect, the stakes are real, and generic solutions don't work. I have delivered production-ready AI/ML systems across several domains: - Retail security: anomaly-detection logic for security gate systems in large retail environments - Agriculture: greenhouse sensor analytics and ML-based decision-support for tomato production optimization - Oil & gas: production analytics, decline-curve interpretation, geospatial analysis, and forecasting dashboards - Healthcare: claims analytics, patient population classification, risk grouping, and payment workflow automation - Travel & pricing: recommendation systems, demand analysis, and personalized pricing automation - Research: peer-reviewed AI/ML and NLP publications, university-level collaboration What I typically build: - Predictive models and forecasting systems for operational and business decisions - Anomaly detection and risk scoring pipelines - Time-series and sensor data analysis - Explainable AI workflows with interpretable outputs for non-technical stakeholders - End-to-end ML proof-of-concepts and production-ready solutions for startups and enterprise teams I work best with clients who have a real business or engineering problem and need someone who can navigate unclear requirements, imperfect data, and domain complexity — and deliver something that actually works.

Gabor K. has worked .

Lampros M.

R, Python, Remote Sensing, Deep Learning and Machine Learning Analyst

Greece

$20/hr

100% Job Success

$60K+ earned

Offers consultations

For more than a decade, I utilize the R and Python programming languages to process, visualize and extract information from data and for almost five years for Geospatial analysis. I'm the author / maintainer of R packages (I've submitted more than 10 to CRAN). You can view my Github profile at the following weblink: github.com/mlampros I work with R and Rstudio on a daily basis. I can work with machine learning algorithms based on almost all CRAN, Github or Gitlab repositories. I can utilize visualization R packages such as ggplot2, plotly, tmap, leaflet, mapview. I can create shiny applications (shiny.rstudio.com/gallery/) and report my results in .pdf, word, .html or any other available format using Rmarkdown. Moreover, I'm capable of using the hybrid 'Rcpp' and 'RcppArmadillo' R packages to improve the efficiency of R code and the Keras and Pytorch deep learning libraries for regression, classification, object detection or image segmentation (with or without pre-trained models).

Muhammad A.

Machine Learning | Computer Vision | AI Automation | LLMs | N8N | RAG

Pakistan

$40/hr

100% Job Success

$60K+ earned

Offers consultations

🏆 A dedicated professional with comprehensive expertise in Machine Learning, Computer Vision, Deep Learning, and Generative AI. I bring extensive experience in deploying advanced AI models and optimizing solutions for both desktop and embedded environments. My background includes certifications in Data Science, Deep Learning, and Machine Learning in Production, complemented by hands-on project experience. 🏆 🏆🏆 𝙀𝙓𝙋𝙀𝙍𝙏 𝙄𝙉 🏆🏆 🚀 𝐂𝐨𝐦𝐩𝐮𝐭𝐞𝐫 𝐕𝐢𝐬𝐢𝐨𝐧 🚀 ✅ Computer Vision Tasks: Skilled in Image Classification, Image Segmentation, and Object Detection. ✅ Deep Learning Frameworks: Proficient with TensorFlow, Keras, and PyTorch for developing sophisticated neural network models. ✅ Advanced Architectures: Experience in implementing Vision Transformers, Swin Transformers, and DETR. ✅ Image Processing: Expertise in numpy, OpenCV, matplotlib, and Pillow for preprocessing and manipulation. ✅ Annotation Tools: Familiar with LabelKit, Labelbox, labelme, and Colabeler for data labeling and annotation. ✅ Embedded Computer Vision: Proficient in deploying models on resource-constrained environments using TensorFlow Lite and other techniques. ✅ Explainable AI: Utilizes LIME and Grad-CAM for model interpretability. 🚀 𝐌𝐚𝐜𝐡𝐢𝐧𝐞 𝐋𝐞𝐚𝐫𝐧𝐢𝐧𝐠 🚀 ⭐ Supervised Learning: Implementing models such as Logistic Regression, SVMs, Decision Trees, and Neural Networks. ⭐ Unsupervised Learning: Expertise in clustering methods like K-means, DBSCAN, Birch, and Hierarchical clustering. ⭐ Semi-Supervised Learning: Leveraging both labeled and unlabeled data for improved model accuracy. ⭐ Classification and Regression: Building and tuning models for various prediction tasks including Linear, Polynomial, Ridge, and Lasso Regression. ⭐ Ensemble Methods: Enhancing model performance with Random Forests, Bagging, and Boosting techniques. ⭐ Feature Engineering: Advanced techniques for feature selection and dimensionality reduction. 🚀 𝐃𝐚𝐭𝐚 𝐀𝐧𝐚𝐥𝐲𝐬𝐢𝐬 🚀 📈 Statistical Analysis: Applying advanced statistical methods for deriving insights from complex datasets. 📈 Data Interpretation: Translating analytical findings into actionable business strategies. 📈 Trend Analysis: Identifying patterns and forecasting future trends. 📈 Predictive Modeling: Utilizing predictive models for risk assessment and decision-making. 📈 Multivariate Analysis: Handling and analyzing multi-dimensional data. 🚀 𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈 🚀 📜 Generative Adversarial Networks (GANs): Experience with StyleGAN, StyleGAN2, StackGAN, CycleGAN, SRGAN, and DCGAN. 📜 Generative AI Tools: Proficient with Ollama, LangChain, Llama3.1, RAG, Gemini, and GPT. 🚀 𝐄𝗺𝗯𝗲𝗱𝗱𝗲𝗱 𝗦𝘆𝘀𝘁𝗲𝗺𝘀 🚀 📦 Embedded Devices Experience: Expertise with Arduino, STM32, ESP32, FPGA, Raspberry Pi, and Nvidia Jetson. 📦 Deployment Projects: Successfully deployed various AI models on edge devices, including Jetson Orin Nano and Jetson Tx2, for tasks like ANPR, emotion detection, gesture recognition, and more. 🏆🏆𝗧𝗢𝗢𝗟𝗦 𝗔𝗡𝗗 𝗧𝗘𝗖𝗛𝗡𝗜𝗖𝗔𝗟 𝗦𝗞𝗜𝗟𝗟𝗦🏆🏆 🔍 Core Tools: Pandas, Numpy, Scikit-Learn, TensorFlow, Keras, PyTorch, XGBoost, LightGBM, CatBoost 🔍 Visualization: Matplotlib, Seaborn, Plotly, ggplot2 🔍 Data Processing: Dask, Apache Spark 🔍 Deep Learning: TensorFlow, Keras, PyTorch, Caffe, Theano 🔍 Natural Language Processing: NLTK, SpaCy, Gensim, BERT 🔍 Generative AI: GANs (StyleGAN, StyleGAN2, StackGAN, CycleGAN, SRGAN, DCGAN), Ollama, LangChain, Llama3.1, RAG, Gemini, GPT 🔍 Transformers: Vision Transformers, Swin Transformers, DETR 🔍 Statistical Modeling: Statsmodels, SciPy 🔍 Computer Vision: OpenCV, PIL (Python Imaging Library), skimage 🔍 Development Tools: Jupyter, Docker, Git, SVN 🔍 Embedded Devices: Arduino, STM32, ESP32, FPGA, Raspberry Pi, Nvidia Jetson 📞 𝗟𝗲𝘁 𝘂𝘀 𝗖𝗼𝗻𝗻𝗲𝗰𝘁: 💼 I am passionate about bringing innovative AI solutions to life and optimizing them for various platforms. Contact me to explore how we can collaborate on your next project and achieve outstanding results together!

Zhiguang Z.

Senior AI Developer|DL ML LLM CV NLP ASR TTS

China

$38/hr

100% Job Success

$6K+ earned

With nearly 10 years of hands‑on experience in deep learning and machine learning, I’m your go‑to AI expert for everything from model research to full‑stack deployment. Whether you’ve got a breakthrough idea for an AI‑driven product or need to modernize your existing systems, I’ll work with you to design, train, and productionize custom models that deliver real impact. I’ve helped startups and enterprises across healthcare, finance and manufacturing: 🚀Custom AI & ML Solutions – From medical‑image analysis and 3D point‑cloud reconstruction (NeRF, segmentation) to NLP pipelines and speech systems (ASR, TTS, multi‑role emotional voice), I translate requirements into scalable, maintainable code. 🚀Competition & Research Edge – Silver and bronze medals on Kaggle; two‑time world runner‑up in ICMI’s attention‑detection; multiple patents in medical AI and image generation; peer‑reviewed publications. 🚀Large‑Scale LLM & Agent Systems – Architected domain‑specific RAG and AI‑agent frameworks for knowledge work automation and conversational platforms. What I Offer 💡Strategic AI consulting: Roadmaps, feasibility studies, and proof‑of‑concepts 💡Custom model development: DL, ML, LLM fine‑tuning, RAG, agent design 💡3D systems: 3D reconstruction, point‑cloud segmentation, NeRF integration 💡Medical AI: Early disease detection, CBCT motion‑artifact correction, MRI/CT workflows 💡Speech & NLP: ASR, TTS, multi‑persona emotional synthesis, conversational bots 💡Technical Toolbox Python · PyTorch · TensorFlow · Keras · Transformers · MLFlow · LLMs · Embeddings · Ranking · RAG, Multi-Agent · Deep-Research · LangChain · LangGraph · LangSmith · Jupyter · Scikit-learn · LightGBM · XGBoost · CatBoost · Optuna · OpenCV · OpenPose · OpenFace · NeRF · 3D Point Cloud · 3D/4D Gaussian Splatting · MONAI · CT Analysis · ASR · TTS · Elasticsearch · Docker · Antigravity · Cursor · Claude Code ✅Augmented Development with Vibe‑Coding By mastering vibe‑coding techniques, I leverage Claude Code and Cursor to accelerate development—while they don’t replace an experienced AI engineer, their assistance helps me produce cleaner, more efficient code. Let’s turn your AI ambitions into reality. Send me a message and let’s discuss how I can help you leverage cutting‑edge AI to accelerate your business.