Docker Swarm & CapRover Infrastructure Engineer (Urgent Recovery & Stabilization)

Posted 2 weeks ago

Worldwide

Summary

We are looking for an experienced DevOps Engineer with deep expertise in Docker Swarm, CapRover, Linux server administration, and networking to help recover and stabilize a production environment. Our infrastructure is hosted on Linux servers running Docker Swarm and CapRover. Following maintenance and cleanup operations, most application services have recovered successfully, but the CapRover management dashboard remains unavailable and is returning a 502 error. The goal is to restore full CapRover functionality without impacting existing production services and to identify and resolve the root cause of the issue. Current Situation - Docker Swarm cluster with multiple nodes - CapRover v1.14.1 - NGINX service is running - Application services are running and accessible - CapRover dashboard returns NGINX 502 - captain-captain service exists but tasks remain stuck in NEW state - Service updates do not progress to container creation - Manager node is healthy and active - Swarm networks and services exist, but CapRover management service is not scheduling correctly - Need investigation before making potentially destructive changes Required Skills - Docker Swarm (Expert) - CapRover (Expert) - Linux Server Administration - NGINX - Docker Networking (Overlay Networks) - Docker Service Scheduling & Troubleshooting - SSL / Let's Encrypt - Disaster Recovery & Infrastructure Stabilization Responsibilities - Diagnose why CapRover service is stuck in Docker Swarm scheduler - Restore CapRover dashboard access - Validate swarm networking and service placement - Investigate overlay network issues - Review Docker service definitions and swarm state - Ensure no production applications are impacted - Document findings and remediation steps - Recommend long-term improvements and backup/recovery procedures Deliverables - Fully functioning CapRover dashboard - Root cause analysis - Infrastructure health assessment - Written documentation of fixes performed - Recommendations to prevent recurrence Environment - Linux - Docker Swarm - CapRover - NGINX - Production workloads currently running

  • $75.00

    Fixed-price
  • Intermediate
    Experience Level
  • Remote Job
  • One-time project
    Project Type
Skills and Expertise
Mandatory skills
Docker
DevOps
Linux System Administration
Activity on this job
  • Proposals:5 to 10
  • Last viewed by client:last week
  • Hires:
    1
  • Interviewing:
    0
  • Invites sent:
    0
  • Unanswered invites:
    0
About the client
Member since Jan 17, 2023
  • NGA
    Lagos8:39 PM
  • $1.2K total spent
    13 hires, 1 active
  • Individual client

Explore similar jobs on Upwork

Chef and Helpers for Biryani and GraviesFixed-price‐ Posted 3 weeks ago
Cooking
Computer Network
Cisco Router
Embedded System
Cisco Certified Network Associate

How it works

  • Post a job icon
    Create your free profile
    Highlight your skills and experience, show your portfolio, and set your ideal pay rate.
  • Talent comes to you icon
    Work the way you want
    Apply for jobs, create easy-to-by projects, or access exclusive opportunities that come to you.
  • Payment simplified icon
    Get paid securely
    From contract to payment, we help you work safely and get paid securely.
Want to get started? Create a profile

About Upwork

  • Rating is 4.9 out of 5.
    4.9/5
    (Average rating of clients by professionals)
  • G2 2021
    #1 freelance platform
  • 49,000+
    Signed contract every week
  • $2.3B
    Freelancers earned on Upwork in 2020

Find the best freelance jobs

Growing your career is as easy as creating a free profile and finding work like this that fits your skills.

Trusted by

  • Microsoft Logo
  • Airbnb Logo
  • Bissell Logo
  • GoDaddy Logo