You will get Build an Image Captioning AI with ResNet50 and Transformers

Project details
I will build a high-accuracy Image Captioning model tailored to your specific needs. unlike basic off-the-shelf solutions, I utilize a state-of-the-art architecture combining ResNet50 for powerful visual feature extraction and a custom Transformer Decoder for generating natural, human-like text.
Whether you need automated descriptions for accessibility, SEO content, or organizing large photo archives, I deliver a clean, documented, and deployment-ready solution. I can also fine-tune the model on your proprietary data to ensure it understands your specific domain context.
Whether you need automated descriptions for accessibility, SEO content, or organizing large photo archives, I deliver a clean, documented, and deployment-ready solution. I can also fine-tune the model on your proprietary data to ensure it understands your specific domain context.
Machine Learning Tools
Azure Machine Learning, ChatGPT, Databricks MLflow, GitHub Copilot, Google Sheets, GPT-3, Keras, MLflow, NLTK, NumPy, OpenCV, pandas, Python, Python Scikit-Learn, PyTorch, scikit-learn, SciPy, Scrapy, Sonnet, SQL, TensorFlow, Word2vec, XGBoostWhat's included
| Service Tiers |
Starter
$40
|
Standard
$200
|
Advanced
$500
|
|---|---|---|---|
| Delivery Time | 3 days | 7 days | 14 days |
Number of Revisions | 1 | 2 | 3 |
Number of Model Variations | 1 | 1 | 2 |
Number of Scenarios | 1 | 2 | 3 |
Number of Graphs/Charts | 0 | 2 | 4 |
Model Validation/Testing | - | ||
Model Documentation | - | - | |
Data Source Connectivity | - | ||
Source Code |
Optional add-ons
You can add these on the next page.
Fast Delivery
+$20 - $100
Additional Revision
+$25
Model Documentation
(+ 1 Day)
+$50Frequently asked questions
About Boules
Machine Learning Engineer
Cairo, Egypt - 4:30 pm local time
Steps for completing your project
After purchasing the project, send requirements so Boules can start the project.
Delivery time starts when Boules receives requirements from you.
Boules works on your project following the steps below.
Revisions may occur after the delivery date.
Data Analysis & Preprocessing
I will review your provided images and captions, clean the dataset, and set up the necessary data pipelines for training.
Model Training & Optimization
I will configure the ResNet50-Transformer architecture and train (or fine-tune) the model on your data to ensure high accuracy.



