NLP Specialist for Bengali Text Annotation & Metadata Extraction

Posted last month

Worldwide

Summary

We are building a Retrieval-Augmented Generation (RAG) system for Bengali content and are looking for a skilled NLP practitioner to enrich raw text data with structured metadata. Given Bengali news or article content, your task will be to extract and generate high-quality annotations, including: • Emotion (e.g., concerned, neutral, optimistic) • Sentiment (positive, negative, neutral) • Topic classification (e.g., corruption, politics, health) • Named entities (key organizations, people, institutions) • Keywords (relevant terms for retrieval optimization) The output must strictly follow a predefined JSON schema and maintain consistency across large datasets. Responsibilities: • Design and/or implement a pipeline to generate structured annotations from Bengali text • Ensure linguistic and contextual accuracy in both Bengali and English • Optimize outputs for downstream retrieval systems (TF-IDF, BM25, hybrid search, etc.) • Handle edge cases such as ambiguous sentiment or mixed topics • Maintain clean, valid, and production-ready JSON outputs Requirements: • Strong proficiency in both Bengali and English (reading & writing) • Experience with NLP tasks such as NER, sentiment analysis, topic classification • Familiarity with LLM prompting, fine-tuning, or annotation pipelines • Experience working with JSON schema validation and structured outputs • Bonus: Experience with RAG systems, search optimization, or multilingual embeddings Preferred Stack (optional): • Python (spaCy, HuggingFace, sentence-transformers, etc.) • Experience with Bengali NLP tools or datasets • Understanding of vector databases (Qdrant, FAISS, etc.) Deliverables: • Annotated dataset in required JSON format • (Optional) Reusable pipeline/script for automated processing Project Type: Freelance / Contract Duration: Short-term (with potential for long-term collaboration) If you have experience working with Bengali language data and understand how structured annotations improve retrieval quality, we’d love to hear from you. This is a part of the assignment to assess the quality of work. If satisfactory, longer term contracts will follow.

  • $100.00

    Fixed-price
  • Intermediate
    Experience Level
  • Remote Job
  • Complex project
    Project Type
Skills and Expertise
Mandatory skills
Python
Natural Language Processing
Activity on this job
  • Proposals:5 to 10
  • Last viewed by client:4 weeks ago
  • Hires:
    1
  • Interviewing:
    0
  • Invites sent:
    0
  • Unanswered invites:
    0
About the client
Member since May 6, 2026
  • CAN
    Toronto7:07 AM
  • $72 total spent
    1 hire, 1 active

Explore similar jobs on Upwork

Database University AssignmentsHourly‐ Posted 9 months ago
SQL
Database
Microsoft Excel
Database Design
Database Management
SQL Server Integration Services
Excel Macros
Excel Formula
Microsoft Power BI
Microsoft Excel PowerPivot
Power Query
Data Entry
Data Cleaning
Data Analytics
Data Extraction
SAP HANA
SAP
ABAP
SAP ERP
SAP BASIS
SAP Business Warehouse
SAP BusinessOne
SAP SD
SAP FI

How it works

  • Post a job icon
    Create your free profile
    Highlight your skills and experience, show your portfolio, and set your ideal pay rate.
  • Talent comes to you icon
    Work the way you want
    Apply for jobs, create easy-to-by projects, or access exclusive opportunities that come to you.
  • Payment simplified icon
    Get paid securely
    From contract to payment, we help you work safely and get paid securely.
Want to get started? Create a profile

About Upwork

  • Rating is 4.9 out of 5.
    4.9/5
    (Average rating of clients by professionals)
  • G2 2021
    #1 freelance platform
  • 49,000+
    Signed contract every week
  • $2.3B
    Freelancers earned on Upwork in 2020

Find the best freelance jobs

Growing your career is as easy as creating a free profile and finding work like this that fits your skills.

Trusted by

  • Microsoft Logo
  • Airbnb Logo
  • Bissell Logo
  • GoDaddy Logo