Docker Swarm & CapRover Infrastructure Engineer (Urgent Recovery & Stabilization)
Worldwide
We are looking for an experienced DevOps Engineer with deep expertise in Docker Swarm, CapRover, Linux server administration, and networking to help recover and stabilize a production environment. Our infrastructure is hosted on Linux servers running Docker Swarm and CapRover. Following maintenance and cleanup operations, most application services have recovered successfully, but the CapRover management dashboard remains unavailable and is returning a 502 error. The goal is to restore full CapRover functionality without impacting existing production services and to identify and resolve the root cause of the issue. Current Situation - Docker Swarm cluster with multiple nodes - CapRover v1.14.1 - NGINX service is running - Application services are running and accessible - CapRover dashboard returns NGINX 502 - captain-captain service exists but tasks remain stuck in NEW state - Service updates do not progress to container creation - Manager node is healthy and active - Swarm networks and services exist, but CapRover management service is not scheduling correctly - Need investigation before making potentially destructive changes Required Skills - Docker Swarm (Expert) - CapRover (Expert) - Linux Server Administration - NGINX - Docker Networking (Overlay Networks) - Docker Service Scheduling & Troubleshooting - SSL / Let's Encrypt - Disaster Recovery & Infrastructure Stabilization Responsibilities - Diagnose why CapRover service is stuck in Docker Swarm scheduler - Restore CapRover dashboard access - Validate swarm networking and service placement - Investigate overlay network issues - Review Docker service definitions and swarm state - Ensure no production applications are impacted - Document findings and remediation steps - Recommend long-term improvements and backup/recovery procedures Deliverables - Fully functioning CapRover dashboard - Root cause analysis - Infrastructure health assessment - Written documentation of fixes performed - Recommendations to prevent recurrence Environment - Linux - Docker Swarm - CapRover - NGINX - Production workloads currently running
$75.00
Fixed-price- IntermediateExperience Level
- Remote Job
- One-time projectProject Type
Skills and Expertise
Activity on this job
- Proposals:5 to 10
- Last viewed by client:last week
- Hires:1
- Interviewing:0
- Invites sent:0
- Unanswered invites:0
About the client
- NGALagos8:39 PM
- $1.2K total spent13 hires, 1 active
- Individual client
Explore similar jobs on Upwork
How it works
Create your free profileHighlight your skills and experience, show your portfolio, and set your ideal pay rate.
Work the way you wantApply for jobs, create easy-to-by projects, or access exclusive opportunities that come to you.
Get paid securelyFrom contract to payment, we help you work safely and get paid securely.
About Upwork
- 4.9/5(Average rating of clients by professionals)
- G2 2021#1 freelance platform
- 49,000+Signed contract every week
- $2.3BFreelancers earned on Upwork in 2020
Find the best freelance jobs
Growing your career is as easy as creating a free profile and finding work like this that fits your skills.
Trusted by