You will get an optimized deep learning model running TensorRT on NVIDIA hardware.

Name: You will get an optimized deep learning model running TensorRT on NVIDIA hardware.
Availability: InStock

Nicholas B.

Top Rated

Nicholas B.

Top Rated

Project details

I have deep, hands-on experience with model inference optimization in compute-constrained environments. I gained this hands-on experience over several years as a Senior ML Engineer. I'm experienced with custom neural network architecture modifications, structured/unstructured pruning, quantization, TensorRT build optimization, Jetson power modes, etc.

Additionally, as a former Head of AI, I think in terms of project outcomes and return on your investment. I don't lose the forest through the trees like many developers often do.

Machine Learning Tools

Python, PyTorch, TensorFlow

What's included

Service Tiers	Starter $5,920	Standard $9,870	Advanced $15,840
Delivery Time	7 days	14 days	28 days
Number of Revisions	0	3	3
Number of Scenarios	1	1	4
Model Validation/Testing	-	-	-
Model Documentation	-	-	-
Data Source Connectivity	-	-	-
Source Code	-

Optional add-ons You can add these on the next page.

Fast Delivery

+$4,935

Additional Revision

+$1,711.66

About Nicholas

Senior AI Leader | ML | CV | NLP | LLMs | Founder | Head of AI

100% Job Success

Victoria, Canada - 6:03 am local time

⭐⭐⭐⭐⭐ Top AI Professional | Proven & Reliable Industry Experience
Does your AI model fail in the real world due to overheating, power constraints, or missed performance targets? Or are you a CTO responsible for a high-stakes AI initiative stuck in the lab? I solve these exact problems.
I specialize in deep optimization for computer vision on constrained edge devices like the NVIDIA Jetson. I use a first-principles approach to re-architect models and fine-tune hardware to ensure you meet performance targets without compromising stability. If your team is facing the "Edge AI Performance Trap," I can deliver the solution.
As a Founder, Head of AI, and Senior ML Engineer, I act as a strategic partner to technology leaders to de-risk projects, establish roadmaps, and upskill teams. I offer a unique blend of strategic leadership and deep technical execution. Whether architecting scalable MLOps pipelines or diving into first-principles model optimization, I deliver solutions that are innovative, reliable, and commercially viable.
Core Pillars of Expertise
1. AI Leadership & Strategy
✔ Fractional Leadership & Roadmapping: Acting as a strategic advisor to CTOs and VPs of Engineering to de-risk high-stakes AI initiatives. Taking ownership of AI strategy, and providing actionable rescue plans to move from prototype to production.
✔ Team Mentorship: Coaching and upskilling engineering teams to build best-in-class AI systems.
✔ Sovereign AI Governance: Advising on independent AI ecosystems (e.g., "The Basket" architecture), ensuring 100% data ownership.
2. Computer Vision & Edge AI Optimization
✔ Advanced Computer Vision: Object Detection/Tracking (YOLOX), Image Classification, and custom CNNs.
✔ Edge AI Performance: Deep expertise in the NVIDIA Jetson ecosystem (AGX Orin, Xavier, Nano).
✔ Inference Acceleration: TensorRT, INT8/FP16 Post-Training Quantization (PTQ), and model pruning.
3. Sovereign LLMs & Technical Alignment (Frontier AI)
✔ Low-Resource LLM Engineering: Fine-tuning (LLaMA, Mistral, Qwen) and reasoning-based ICL (reasoning chains) for complex, low-resource languages.
✔ Sovereign Infrastructure Architect: Designing digital infrastructure purpose-built for polysynthetic languages, prioritizing data autonomy.
✔ AI Safety & Psychometrics: Independent researcher behind PsychoBenchPro, measuring persona stability and mitigating "Alignment Faking" in frontier models.
4. Production AI & MLOps Infrastructure
✔ Full-Cycle MLOps: Architecting and implementing robust CI/CD pipelines for ML.
✔ Infrastructure: AWS, Docker, and Kubernetes for scalable, containerized deployments.

Technology Stack & Specialties
Languages & Frameworks: Python, C/C++, SQL, Bash, PyTorch, TensorFlow, Keras, TensorRT, vLLM, HuggingFace.
Models & Architectures: YOLOX, Mistral, LLaMA, Qwen, Multi-modal LLMs, Custom CNNs, Reasoning Chains.
Optimization & Tooling: DVC, MLflow, Weights & Biases, Grafana, Prometheus, Loki, Polygraphy, Pydantic, Prefect, Custom Power Mode Configuration.
Cloud & Edge: NVIDIA Jetson (AGX Orin, Xavier, Nano), Microsoft Azure (Data Lake), AWS, Docker, Kubernetes, OpenCV, Pillow, Git.

Key Achievements
✔ 60-Day Concept-to-Live: Led the transition for Fireweed AI from paper concept to live system in two months.
✔ Scaled to Production: Led the AI strategy and execution to scale an Edge AI product from 2 prototypes to dozens of mission-critical deployments across four countries.
✔ 2x Performance-per-Watt: Doubled CV efficiency on Jetson, slashing power by 50% while maintaining 60fps throughput.
✔ 75% Model Size Reduction: Reduced parameter counts by 75% via receptive field analysis without sacrificing accuracy.
✔ 100x+ Training Scalability: Designed an MLOps platform that increased throughput from ~10 models per sprint to thousands.
✔ Safety Certified: Advanced credentials in AI Safety (CAIS), Technical AI Safety, and AGI Strategy (BlueDot Impact).

Why Choose Me?
✔️ Business-Focused, Not Just Technical: I don’t just build models; I build solutions that drive business value. My experience as a Head of AI means I always connect the technical execution to your strategic objectives.
✔️ Production-Ready From Day One: My work is robust and built for the real world. I deliver clean, well-documented systems that your team can confidently maintain long after our contract ends.
✔️ A Strategic Partner, Not Just a Freelancer: I excel at de-risking projects. When you hire me, you get a fractional leader who elevates your team's capabilities while delivering on critical goals.
✔️ Clear and Proactive Communication: I translate complex technical hurdles into plain English, ensuring you always have visibility and confidence in the project's direction.

Ready to move your AI project from the lab to production? Let's connect and de-risk your roadmap.

Steps for completing your project

After purchasing the project, send requirements so Nicholas can start the project.

Delivery time starts when Nicholas receives requirements from you.

Nicholas works on your project following the steps below.

Revisions may occur after the delivery date.

Gather Context, Resources and Access + Alignment on Goals and Expectations

I'll meet with your team to understand the broader context of this project, gain access to technical resources and then I'll communicate what I believe I can offer. We will discuss potential risks and how we'll manage them.

Benchmarking

I'll benchmark the default approach to understand the starting point.

Review the work, release payment, and leave feedback to Nicholas.

Select service tier

Starter$5,920

Standard$9,870

Advanced$15,840

TensorRT Support

Ad hoc support with TensorRT problems.

Delivery Time 7 days
Number of Revisions 0
Number of Scenarios 1

7 days delivery — Jun 28, 2026

Revisions may occur after this date.

Upwork Payment Protection

Fund the project upfront. Nicholas gets paid once you are satisfied with the work.

You will get an optimized deep learning model running TensorRT on NVIDIA hardware.

Let a pro handle the details

Let a pro handle the details

Project details

Machine Learning Tools

What's included

About Nicholas

Senior AI Leader | ML | CV | NLP | LLMs | Founder | Head of AI

Steps for completing your project

After purchasing the project, send requirements so Nicholas can start the project.

Nicholas works on your project following the steps below.

Gather Context, Resources and Access + Alignment on Goals and Expectations

Benchmarking

Review the work, release payment, and leave feedback to Nicholas.

Select service tier

TensorRT Support

You will get an optimized deep learning model running TensorRT on NVIDIA hardware.

Let a pro handle the details

Let a pro handle the details

Project details

Machine Learning Tools

What's included

About Nicholas

Senior AI Leader | ML | CV | NLP | LLMs | Founder | Head of AI

Steps for completing your project

After purchasing the project, send requirements so Nicholas can start the project.

Nicholas works on your project following the steps below.

Gather Context, Resources and Access + Alignment on Goals and Expectations

Benchmarking

Review the work, release payment, and leave feedback to Nicholas.

Select service tier

TensorRT Support

Optional add-ons (2)