AI Engineer — RAG & Semantic Search for Team Chat Platform (Python, pgvector, Embeddings)

Posted 2 weeks ago

Worldwide

Summary

We are building TeamChat, a workspace-based team collaboration platform (similar to Slack). This role owns the RAG and semantic search layer: making every message and file in a workspace searchable and usable as grounded context for AI features. We have a detailed scope document ready to share with shortlisted candidates. This is one of two AI roles we are hiring; strong performance leads to ongoing, long-term collaboration. CORE RESPONSIBILITIES & SCOPE OF WORK 1. Embedding Pipeline: Incremental indexing of messages and uploaded files (chunking, dedup, token-aware splitting, metadata preservation), with re-indexing and deletion propagation when sources change. 2. Vector Store & Retrieval: pgvector or Pinecone; hybrid retrieval (BM25 + vector + recency boost); relevance evaluation. Workspace/channel-level permission filtering so users never retrieve content they cannot access. 3. Semantic Search Feature: Natural-language search over workspace history with filters (from:, in:, date ranges), source citations, and latency budget suitable for interactive use. 4. Quality & Cost: Offline evaluation set for retrieval quality, embedding cost tracking and optimization, retrieval logging. 5. Delivery: Python service with documented internal APIs the messaging backend and AI feature team can call; tests + eval harness included. REQUIRED TECH STACK - Python 3.11+, FastAPI - Embeddings + vector DB: pgvector or Pinecone - Hybrid search (BM25 + vector), rerankers - PostgreSQL, Redis, Celery or equivalent workers PROJECT DETAILS - Engagement: Hourly, $15–$25/hr depending on experience. ~30 hrs/week, initial 3 months, ongoing long-term for the right person. - Process: Daily async standup (English, text), code review via GitHub PRs, 2-week sprints. At least 3–4 hours of overlap with JST (UTC+9). - IP & Code: All code delivered in our GitHub org from day one; full source ownership by us. - Language: English required. Urdu-speaking developers welcome. WHO SHOULD APPLY Please do NOT apply if your experience is limited to basic chatbot demos, simple OpenAI API wrappers, or tutorial-level LangChain projects. We will ask about production metrics (cost, latency, retrieval quality). QUESTIONS TO ANSWER IN YOUR PROPOSAL 1. Describe a RAG system you shipped to production: corpus size, retrieval architecture, and how you measured retrieval quality. 2. How would you design RAG over chat messages where retrieval must respect per-channel permissions? 3. What was your monthly embedding + inference cost in a past project, and how did you reduce it? 4. GitHub/portfolio links, timezone, weekly availability, proposed rate. 5. Start your proposal with the word TEAMCHAT.

  • More than 30 hrs/week
    Hourly
  • 3-6 months
    Duration
  • Intermediate
    Experience Level
  • $15.00

    -

    $25.00

    Hourly
  • Remote Job
  • Complex project
    Project Type
Skills and Expertise
Mandatory skills
Python
Retrieval Augmented Generation
Activity on this job
  • Proposals:20 to 50
  • Last viewed by client:last week
  • Interviewing:
    2
  • Invites sent:
    0
  • Unanswered invites:
    0
About the client
Member since Jun 2, 2026
  • Japan
    渋谷区7:37 AM

Explore similar jobs on Upwork

Build Agentic AI EngineHourly‐ Posted 9 months ago
Artificial Intelligence
turn 2D pose detection to 3dFixed-price‐ Posted 4 weeks ago
Computer Vision
3D Modeling
OpenCV

How it works

  • Post a job icon
    Create your free profile
    Highlight your skills and experience, show your portfolio, and set your ideal pay rate.
  • Talent comes to you icon
    Work the way you want
    Apply for jobs, create easy-to-by projects, or access exclusive opportunities that come to you.
  • Payment simplified icon
    Get paid securely
    From contract to payment, we help you work safely and get paid securely.
Want to get started? Create a profile

About Upwork

  • Rating is 4.9 out of 5.
    4.9/5
    (Average rating of clients by professionals)
  • G2 2021
    #1 freelance platform
  • 49,000+
    Signed contract every week
  • $2.3B
    Freelancers earned on Upwork in 2020

Find the best freelance jobs

Growing your career is as easy as creating a free profile and finding work like this that fits your skills.

Trusted by

  • Microsoft Logo
  • Airbnb Logo
  • Bissell Logo
  • GoDaddy Logo