You will get Lakehouse for analytics and machine learning


Project details
A data platform (Lakehouse) using only open source soulitions:
Trino, MinIo, Hive, Apache Iceberg, Apache Airflow, Apache Nifi, Metabase, Data Hub, DBT, PySpark.
Solutions used for data lakehouse can change.
Trino, MinIo, Hive, Apache Iceberg, Apache Airflow, Apache Nifi, Metabase, Data Hub, DBT, PySpark.
Solutions used for data lakehouse can change.
What's included
| Service Tiers |
Starter
$1,000
|
Standard
$1,001
|
Advanced
$1,002
|
|---|---|---|---|
| Delivery Time | 45 days | 45 days | 43 days |
Number of Revisions | 1 | 1 | 1 |
Model Validation/Testing | |||
Model Documentation | |||
Data Source Connectivity | |||
Source Code |
Optional add-ons
You can add these on the next page.
Additional Revision
+$100About Gustavo
Machine Learning Engineer | Data Scientist | Data Engineer
Belo Horizonte, Brazil - 12:07 am local time
From data extraction, exploratory data analysis and preparation, modeling to model deployment and monitoring using only open source solutions.
I am also able to develop the whole data infrastructure for data analytics and BI, using open source solutions.
Also familiar with cloud solutions such as AWS, Azure, GCP and Databricks.
Steps for completing your project
After purchasing the project, send requirements so Gustavo can start the project.
Delivery time starts when Gustavo receives requirements from you.
Gustavo works on your project following the steps below.
Revisions may occur after the delivery date.
Set up all the infrastructure
Using docker, set up all the project infrastructure
Build simple pipeline
To ensure everything is running smoothly and I am able to test the system, one simple pipeline will be built.