Senior Architect & Lead Engineer — Server Management Platform

Posted 4 weeks ago

Worldwide

Summary

Principal / Lead Go Systems Engineer — Bare-Metal & GPU Infrastructure Project Overview: We are building a greenfield, high-scale server management platform for a GPU server manufacturer. The platform will manage Compal BMC hardware via Redfish/IPMI, monitor next-gen NVIDIA (B200/B300) and AMD (MI300/MI350) GPUs, and deliver a white-labeled management console shipped with every server. This is an intensive, 18–20 week project requiring a senior-level systems expert who can achieve immediate technical autonomy from week one. What You Will Own: • BMC adapter development — Compal/AMI MegaRAC SP-X Redfish quirks and IPMI fallback • Two-tier architecture — control plane + zone workers designed for 25,000 device scale • GPU monitoring design — NVIDIA DCGM + AMD ROCm SMI integration architecture • Code review — every critical PR, same-day turnaround • Phase 0 solo — BMC characterization report delivered in week 1 • Client-facing — architecture decisions and technical escalations Requirements (Must Have) • 5+ years Go — systems-level, not just APIs • Redfish / IPMI / BMC — gofish, bmclib hands-on production experience • Distributed systems — NATS JetStream, Temporal, PostgreSQL at scale • GPU monitoring — NVIDIA DCGM or AMD ROCm SMI experience • Kubernetes + Helm — production deployments • Available immediately — no exceptions Tech Stack: Go 1.22+ · Redfish · IPMI · bmclib · gofish · DCGM · ROCm SMI · Temporal · NATS JetStream · PostgreSQL 16 · VictoriaMetrics · Kubernetes · Helm This is the critical path role. The entire team depends on this person. Only apply if you have direct BMC protocol experience (gofish or bmclib). Generic proposals will not be reviewed. Best of Luck

  • $2,000.00

    Fixed-price
  • Expert
    Experience Level
  • Remote Job
  • Ongoing project
    Project Type

Contract-to-hire opportunity

This lets talent know that this job could become full time.
Learn more
Skills and Expertise
Mandatory skills
Kubernetes
gRPC
Network protocols
Activity on this job
  • Proposals:Less than 5
  • Last viewed by client:3 weeks ago
  • Interviewing:
    1
  • Invites sent:
    1
  • Unanswered invites:
    1
About the client
Member since May 29, 2014
  • India
    Kolkata6:15 AM
  • $450 total spent
    3 hires, 0 active

Explore similar jobs on Upwork

Salesforce Solution ArchitectFixed-price‐ Posted 4 weeks ago
Salesforce App Development
Salesforce Service Cloud
Salesforce Sales Cloud
Cloud Computing
Cloud Computing
TOGAF

How it works

  • Post a job icon
    Create your free profile
    Highlight your skills and experience, show your portfolio, and set your ideal pay rate.
  • Talent comes to you icon
    Work the way you want
    Apply for jobs, create easy-to-by projects, or access exclusive opportunities that come to you.
  • Payment simplified icon
    Get paid securely
    From contract to payment, we help you work safely and get paid securely.
Want to get started? Create a profile

About Upwork

  • Rating is 4.9 out of 5.
    4.9/5
    (Average rating of clients by professionals)
  • G2 2021
    #1 freelance platform
  • 49,000+
    Signed contract every week
  • $2.3B
    Freelancers earned on Upwork in 2020

Find the best freelance jobs

Growing your career is as easy as creating a free profile and finding work like this that fits your skills.

Trusted by

  • Microsoft Logo
  • Airbnb Logo
  • Bissell Logo
  • GoDaddy Logo