Hire the Best AWS Operations Engineers
Addis Ababa, Ethiopia
I build infrastructure that stays up when it matters. Government platforms for 2M+ citizens, banking systems for 1M+ users, and AI rendering stacks on GPU Kubernetes clusters. Shipped on time, documented for the team that inherits it. 5+ years owning the full stack: physical hardware, hypervisors, Kubernetes, service mesh, cloud networking, CI/CD, observability, and GPU AI workloads. Across government, fintech, SaaS, and deep-tech startups. Government Scale — OpenG2P Ethiopia, 3 Ministries Leading platform engineering for Ethiopia's national OpenG2P welfare system across ATI, MoWSA, and EDRMC. Serving 2M+ beneficiaries on RKE2 with Rancher, Istio mTLS, Keycloak SSO, WireGuard site-to-site VPN, XCP-ng hypervisor with Hardware RAID 1+0, and a full Prometheus + Grafana + Alertmanager + Loki observability stack. Currently executing a large-scale ODK to OpenG2P data migration with schema mapping, validation pipelines, audit trails, and phased rollback. AWS Serverless Architecture — WeTruck Architect and operator of WeTruck's serverless logistics platform. CloudFront to ALB to WAF (deny-by-default with Telebirr payment gateway IP whitelisting) to Lambda (FastAPI via Mangum). Fully Terraformed infrastructure. GitHub Actions with OIDC federation (zero long-lived credentials), RDS PostgreSQL, DynamoDB for JWT token store, SQS with dead-letter queue, Secrets Manager, AWS Amplify for three Next.js frontends (back-office, transporter, shipper), and Route 53 private/public DNS. I own production incidents end-to-end and authored the full deployment runbook and troubleshooting playbook. AWS in daily use: Lambda, EC2, EKS, CloudFront, ALB, WAF, RDS, DynamoDB, SQS, Secrets Manager, Amplify, ECR, Route 53, IAM OIDC, VPC (private subnets, NAT Gateway, VPC endpoints), CloudWatch, ACM. MPLS and AI Rendering Infrastructure — MetaPlux (Tech Lead) Tech Lead on a distributed GPU-aware AI rendering platform. MPLS networks for deterministic, low-latency routing across distributed rendering nodes with label-switched paths and traffic engineering for QoS-sensitive GPU workload traffic. BGP and OSPF routing across hybrid environments connecting on-premises GPU clusters to cloud egress points. WireGuard VPN tunnels for secure, auditable connectivity. Multi-cloud strategy across AWS and Azure for rendering workload distribution. Platform architecture for GPU-aware Kubernetes scheduling and job orchestration. This role sits at the intersection of deep networking and modern cloud infrastructure, the combination most DevOps engineers don't have. AI and GPU Infrastructure — Exponent.ch (Switzerland) Provisioned and managed NVIDIA GPU-enabled RKE2 clusters running AI and automation workloads: n8n, Langfuse LLM observability, LibreChat, and ClickHouse analytics. Standardized Helm deployments across 5+ environments using shared-stacks architecture. Integrated Teleport for zero-trust cluster access, automated secrets via Lade and 1Password. Prometheus + Grafana + Mimir observability with 99.9% metric coverage, 60% MTTR reduction, and 3 critical outages prevented through proactive alerting. Banking Infrastructure — 6 Ethiopian Banks, 1M+ Users Delivered high-availability mobile banking and USSD platforms for Siinqee Bank, Hijra Bank, Wegagen Bank, and others with 99.95% uptime across 1M+ users. Designed MPLS network architecture for multi-branch WAN connectivity with QoS for transaction data. Built Jenkins CI/CD for Maven Java apps (4 hours to 15 minutes). ELK processing 10TB+ daily logs. Proxmox and AWS hybrid cloud with ~35% cost reduction. Full Technical Stack Container Orchestration: Kubernetes (RKE2, EKS, K3s), Docker, Helm, Rancher, OpenShift Infrastructure as Code: Terraform, Terragrunt, Ansible, OpenTofu CI/CD: GitHub Actions, GitLab CI, Jenkins Cloud: AWS (full suite), Azure, OpenStack Networking: MPLS, BGP, OSPF, WireGuard, HAProxy, NGINX, VLAN, VPC design Service Mesh: Istio (mTLS, traffic policies, ingress) Observability: Prometheus, Grafana, Mimir, Alertmanager, Loki, OpenTelemetry, ELK, Zabbix, SigNoz Security and Identity: Keycloak, HashiCorp Vault, Teleport, IAM OIDC Virtualization: Proxmox, XCP-ng, VMware AI and Automation: NVIDIA GPU Kubernetes, YOLO, ONNX, TensorRT, n8n, Langfuse, LibreChat, ClickHouse Databases: PostgreSQL, MySQL, MongoDB, Redis, Elasticsearch, DynamoDB Backend: Python (FastAPI), Bash, Node.js How I work I overcommunicate early, give realistic timelines, and ship verified, not "it should work." I treat testing as the real deliverable, not a checkbox. If something is out of scope or a bad idea, I'll tell you before you spend money on it. Message me if you need: - AWS or Kubernetes infrastructure built or fixed - CI/CD pipelines developers actually trust - GPU and AI infrastructure with inference pipelines and observability - Migration from cloud to cloud, or on-prem to production cloud - MPLS, BGP, or complex network architecture across hybrid environments - Observability that catches problems before users do
- Amazon Web Services
- Docker
- Kubernetes
- NGINX
- Automation
- CI/CD
- GitLab
- Linux System Administration
- AWS Amplify
- AWS Lambda
- Terraform
- OpenStack
- Rancher
- n8n
- Prometheus
Los Angeles, California
📣 FREE technical consultation (details below) 🐻 Proud UC Berkeley graduate (Applied Mathematics & Economics) and an entrepreneurial, reliable full-stack software developer with broad expertise across modern technologies (listed below). 💪💪💪 With experience as: • Full-Stack Team Lead at Camping World, a publicly traded company • Lead Technical Teaching Assistant at Hack Reactor, a leading software engineering bootcamp • Founder of Atini Studio, a high-performance software development agency With years of experience working with clients and leading engineering teams, I prioritize accountability, clear communication, and efficient execution. 🌈🌈🌈 I’m the founder of Atini Studio, a vibrant software development agency and a team I like to call software doctors: highly skilled engineers and creative UI/UX designers who diagnose, fix, and elevate digital products. ⌨️⌨️⌨️ What I help clients with: 📌 MVP development for startups 📌 Full-stack web applications 📌 Cross-platform mobile apps 📌 AI-powered features & integrations 📌 Clean implementation from Figma to production 📌 API integrations (Stripe, OpenAI, Google, Zoom, etc.) 📌 Performance optimization 📌 Technical architecture planning 📌 Product roadmap & scaling strategy We also leverage AI internally to accelerate development workflows and improve efficiency, so you get faster delivery without sacrificing quality. 🧩 Complimentary Technical Consultation Not sure how to move forward with your product? I can help you clarify: • The right tech stack • MVP scope • Timeline and budget expectations • System architecture decisions • Whether your current product is built correctly I offer a complimentary strategy session to help you create a clear and practical execution plan. 💻 Tech Stack: • Front End JavaScript | TypeScript | React | Next.js | React-Native | Angular | Vue.js | Tailwind CSS • Back End Node.js | NestJS | Python | Django | PostgreSQL | MongoDB | MySQL | ORM | GraphQL • API Integrations ChatGPT / OpenAI | Stripe | Google | Zoom | and more... • DevOps Amazon Web Services (AWS) | Firebase/GCP(Google Cloud Platform) | Azure | Docker • Testing Jest | Mocha | TDD • Developer Tools Git | npm | Webpack | Babel | Agile Methodology | Nginx • Design Adobe XD | Figma
- Node.js
- React
- AWS Amplify
- CSS 3
- RESTful API
- React Native
- Firebase
- Angular
- Amazon EC2
- JavaScript
- Database
- WordPress
- ChatGPT
- AI Development
- AWS Development
Chittagong, Bangladesh
Struggling with unreliable infrastructure, VMware licensing costs, or a stalled cloud migration? I'm a DevOps & Cloud Engineer who builds and stabilizes production environments across AWS, Azure, Proxmox, and Ceph — with a focus on uptime and automation. Most DevOps engineers specialize in either cloud or on-prem infrastructure. I do both: I've run production Proxmox VE clusters with Ceph storage alongside AWS, Azure, and GCP deployments, and I handle VMware-to-Proxmox migrations end-to-end — a combination few freelancers on Upwork cover with hands-on production experience. Here's how I typically help clients: ✔ Design and deploy cloud infrastructure on AWS, Azure, or GCP ✔ Build and troubleshoot Proxmox clusters, including VMware-to-Proxmox migrations ✔ Optimize and recover Ceph storage clusters ✔ Set up Docker and Kubernetes environments with CI/CD pipelines ✔ Implement monitoring with Zabbix and Grafana ✔ Plan and test backup and disaster recovery strategies ✔ Secure infrastructure with VPNs, firewalls, and Zero Trust practices Top Rated Plus on Upwork with a 4.98 rating across 51 completed jobs and over 2,870 hours worked. AWS Cloud Practitioner Essentials certified. Clients include startups, MSPs, SaaS companies, and enterprise teams relying on me for production-grade infrastructure. Send me a message with what you're working on and I'll respond with next steps within a few hours.
- Docker
- Proxmox VE
- AWS Application
- Linux System Administration
- System Administration
- Zero Trust Architecture
- VMware Administration
- VMware ESX Server
- Microsoft Azure Administration
- Windows Server
- Azure DevOps
- Terraform
- Kubernetes
- Ceph
- Cloud Architecture
- OpenStack
- OpenShift
San Antonio, Texas
Top-Rated Plus, Expert-Vetted, Upwork Top 1 Percent, Fortune 500, Federal, Inc 5000 Thanks for reading! 🚀 I build the world's leading AI and DevOps tooling — and we bring that same engineering rigor to every Upwork engagement. I've shipped production systems for Fortune 500 enterprises, federal agencies, the U.S. military, and Inc. 5000 startups. That means the patterns we deploy for your project aren't theoretical — they're running right now in environments with real compliance requirements, real uptime SLAs, and real scale. We've designed the kind of infrastructure other teams buy as a product, and now we're making that caliber of work available to smaller teams and faster-moving projects. Here's what you get when you hire me: 🏆 BATTLE-TESTED EXPERTISE Terraform, CI/CD, LLM pipelines, observability, secure-by-default AWS architecture — the full modern cloud stack, built by people who've deployed it in regulated environments. 🤖 FRONTIER AI & DEVOPS TOOLING We build the systems other agencies resell. When you hire me, you're hiring the builders, not the middlemen. 💰 ENTERPRISE-GRADE WORK, STARTUP-FRIENDLY PRICING I'm deliberately structured to deliver senior-level results at a rate that fits real-world budgets. No big-agency markup, no junior engineers pretending to be senior. Whether you need to cut an out-of-control AWS bill, ship a CI/CD pipeline you can actually trust, stand up an AI system that solves a real business problem, or just get a second opinion from people who've seen it all — I'd love to talk about what you're building. Hire me and you're getting the expertise of a team trusted by the world's most demanding clients, at a rate that actually makes sense. Let's build something.
- PHP
- AWS Lambda
- Cloud Security Framework
- Laravel
- Amazon EC2
- Cloud Computing
- Load Balancing
Bahawalpur, Pakistan
Your data is scattered across APIs, databases, and third-party tools and right now it takes your team hours to pull reports that should take seconds. I fix that. I'm Daniyal, a Data Engineer who builds production-grade ETL/ELT pipelines that collect, transform, and deliver clean data to your dashboards automatically. My pipelines run 24/7 and scale with your business. Recent Results: • BigQuery warehouse ingesting 50,000+ daily records client margins up 22% • Airflow ETL processing 600,000+ weekly records for a real estate platform • Automated data pipeline generating 600+ qualified leads in 45 days ($75K in new revenue) • Cut manual reporting from 8 hours/week to zero with scheduled orchestration What I Build: • ETL/ELT pipelines on BigQuery, Snowflake, and Redshift • Apache Airflow DAGs for scheduled, monitored data orchestration • Data warehouse architecture with dbt transformations • Real-time and batch data ingestion from APIs, databases, and flat files • Monitoring, alerting, and data quality checks built into every pipeline Tech Stack: Warehouses: BigQuery, Snowflake, Redshift Orchestration: Apache Airflow, dbt, Prefect Processing: PySpark, Pandas, SQL, Kafka Cloud: AWS (S3, Glue, Lambda, Redshift), GCP (BigQuery, Dataflow, Composer) Infrastructure: Docker, Terraform, CI/CD Top Rated • 100% Job Success Score • Response within 2 hours Full documentation, clean handoff, and 30-day post-delivery support on every project. Send me your data challenge and current stack. I'll reply within 2 hours with a clear plan.
- Data Engineering
- BigQuery
- Apache Airflow
- Data Scraping
- Python
- SQL
- dbt
- Apache Kafka
- Data Extraction
- AWS Lambda
- PySpark
- API Integration
- Selenium
- Beautiful Soup
- Scrapy
- PostgreSQL
- Django
- Snowflake
- Data Visualization
- ETL
Budapest, Hungary
📌 About Us Led by a DevOps and SRE veteran with 20+ years of industry experience, we are a powerhouse team of 100+ top-tier DevOps, SecOps, Cloud, and Site Reliability Engineers. We specialize in designing, building, and managing infrastructures that empower innovation while guaranteeing stability, security, and cost-efficiency. With over 1,000 successful projects delivered for startups, enterprises, and global brands across fintech, e-commerce, health tech, media, and AI/ML, we take end-to-end ownership—from initial architecture design to 24x7x365 L1/L2/L3 support. Core Capabilities & Business Impact 1. Platform Engineering, DevOps & CloudOps We build multicloud architectures and scalable systems that enable daily releases through automated "golden paths" and predictable environments. Infrastructure as Code (IaC) & Automation: Terraform, Pulumi, CloudFormation, Ansible, Chef, Puppet. Containerization & Orchestration: Kubernetes, Docker, EKS, Rancher. CI/CD & GitOps: GitHub Actions, GitLab CI, Jenkins, CircleCI, Argo CD, Flux. Web & Database Tuning: LAMP and LEMP stacks setup, database tuning, and high-availability website speed optimization. 2. 24/7 Site Reliability Engineering (SRE) & IT Support We provide true 24x7x365 L1, L2, and L3 support for applications, servers, and end-users, ensuring SLA/SLO-driven operations. Reliability & Incident Management: On-call rotations, escalation trees, postmortems, root cause analysis, and error budgets. Observability: Transparent SLI/SLO dashboards using Prometheus, Grafana, OpenTelemetry, ELK, Datadog, and New Relic. IT Consultancy & Workstation Support: End-user maintenance (B2B/B2C), patching, updates, backup management, and policy enforcement via NinjaOne. Email Deliverability: DNS, DKIM, SPF, DMARC, and MX configuration. 3. FinOps & Cloud Cost Optimization We turn infrastructure into a strategic advantage by reducing cloud costs by 20–60%. Cost Control: Budgeting, forecasting, rightsizing, cost governance, and waste reduction on AWS, Azure, and GCP. FinOps Tooling: CloudHealth, AWS Cost Explorer, GCP Billing. 4. Security, Networking & Compliance We ensure robust security postures and continuous compliance readiness. Security & Identity: Entra ID, HashiCorp Vault, SOPS, RBAC audits, WAF/Shield. Policy & Compliance: Policy-as-Code (OPA/Gatekeeper), CIS Benchmarks, SOC 2, and ISO 27001 readiness. Network Hardware & Integrations: CISCO remote networks, Mikrotik remote devices, FortiGate, and QNAP configuration. Server Hardening: Deep optimization and security hardening for Linux, UNIX, and Windows Server systems. 5. MLOps & AI Infrastructure We partner with ML engineers to build reliable, reproducible production workflows. ML Pipelines: MLflow, Kubeflow, SageMaker. Infrastructure: GPU-enabled infrastructure setups, data versioning, and model tracking. Comprehensive Technology Stack - Cloud Providers: AWS (Certified Partner), Microsoft Azure (Certified Partner), GCP, OCI. - Operating Systems: Linux (20+ years expertise), UNIX, Windows Server. - Containers & Orchestration: EKS, Kubernetes, Docker, Rancher. - IaC & Configuration: Terraform, Pulumi, CloudFormation, Ansible, Chef, Puppet. - CI/CD & GitOps: Argo CD, Flux, GitHub Actions, GitLab CI, Jenkins, CircleCI. - Observability & Monitoring: Prometheus, Grafana, OpenTelemetry, ELK Stack, Datadog, New Relic. - Security & Policy: OPA, Gatekeeper, Vault, SOPS, Entra ID, WAF/Shield. - FinOps & MLOps: CloudHealth, AWS Cost Explorer, GCP Billing, MLflow, Kubeflow, SageMaker. - Networking & IT Management: FortiGate, CISCO, Mikrotik, QNAP, NinjaOne. - Web & Email: LAMP, LEMP, DNS, DKIM, SPF, DMARC, MX. Who We Work With Startups: Building scalable, secure infrastructures from day one. Enterprises: Driving modernization, automation, and DevOps transformations. Finance & SaaS: Implementing rigorous security, compliance, and FinOps practices. ML/AI Teams: Deploying robust frameworks for model scaling. Ready to transform your infrastructure, optimize costs, and secure 24/7 reliability? Let’s talk.
- Amazon Web Services
- Apache Tomcat
- Ansible
- Docker
- Windows Administration
- Service Cloud Administration
- Linux System Administration
- Technical Support
- Apache Administration
- DevOps
How it works
Post a job for free Post a job
Tell us what you need. Create your own job post or generate one with AI then filter talent matches.
Hire top talent fast
Consult, interview, and hire quickly, so you can meet the freelancers you're excited about.
Collaborate easily
Use Upwork to chat or video call, share files, and track project progress right from the app.
Payment simplified
Manage payments in one place with flexible billing options. Only pay for approved work, hourly or by milestone.
Don't just take our word for it
“Upwork provides an umbrella-level of security. I can see a talent’s work history and ratings. I can hold payments in escrow. I can communicate through Upwork Messages instead of working through my email address.”
Kim Darling
Emerald Tiger
“Upwork is the best platform to hire skilled professionals when we're not looking for a full-time employee. All the companies in our portfolio use Upwork to find talent across a wide range of fields.”
David Merry
Kinetic Investments
“Our very specific requirements can be a challenge—With Upwork, we’re able to access a bigger community to ensure the success of our projects.”
Katja Krohn
Summa Linguae
How do I hire a AWS Operations Engineer on Upwork?
You can hire a AWS Operations Engineer on Upwork in four simple steps:
- Create a job post tailored to your AWS Operations Engineer project scope. We’ll walk you through the process step by step.
- Browse top AWS Operations Engineer talent on Upwork and invite them to your project.
- Once the proposals start flowing in, create a shortlist of top AWS Operations Engineer profiles and interview.
- Hire the right AWS Operations Engineer for your project from Upwork, the world’s largest work marketplace.
At Upwork, we believe talent staffing should be easy.
How much does it cost to hire a AWS Operations Engineer?
Rates charged by AWS Operations Engineers on Upwork can vary with a number of factors including experience, location, and market conditions. See hourly rates for in-demand skills on Upwork.
Why hire a AWS Operations Engineer on Upwork?
As the world’s work marketplace, we connect highly-skilled freelance AWS Operations Engineers and businesses and help them build trusted, long-term relationships so they can achieve more together. Let us help you build the dream AWS Operations Engineer team you need to succeed.
Can I hire a AWS Operations Engineer within 24 hours on Upwork?
Depending on availability and the quality of your job post, it’s entirely possible to sign up for Upwork and receive AWS Operations Engineer proposals within 24 hours of posting a job description.
Find more freelancers
Similar AWS Operations Engineer Skills
- AWS CloudFormation Developers
- AWS DevOps Engineers
- Certified AWS Cloud Network Engineers
- Certified AWS DevOps Engineers
- AWS CodePipeline Specialists
- Certified AWS Network Engineers
- AWS Lambda Developers
- AWS Developers
- AWS CloudFront Developers
- AWS Cloudwatch Developers
- AWS IoT Device Management Developers
- Certified AWS Cloud Engineers
- AWS Elastic Beanstalk Developers
- AWS CodeDeploy Developers
- AWS CodeBuild Specialists
- AWS Solution Architects
Top Countries for AWS Operations Engineers
- Network Administrators in Sri Lanka
- Network Administrators in Kazakhstan
- Network Administrators in Zimbabwe
- Network Administrators in Portugal
- Network Administrators in Montenegro
- Network Administrators in Nepal
- Network Administrators in Kenya
- Network Administrators in Mexico
- Network Administrators in the Netherlands
- Network Administrators in Pakistan
- Red Hat Administrators in Spain
- Red Hat Administrators in Romania
- Red Hat Administrators in Argentina
- Red Hat Administrators in Australia
- Network Administrators in Nigeria
- Computer Network Architects in Nepal