The Data Experiments
Spark | Python | Scala | AWS | Big Data | Kafka | Databases | NoSQL | SQL
Overview
We are a multidisciplinary and an award-winning Data Engineering team and developers with a focus on business solutions and web applications with experience in 70+ Data Projects; The founder is a Kaggle Grandmaster that is an expert in Data Engineering, visualisations and is used to developing Data Platforms/Products, visualisations and Dashboards as end-to-end engineers. Python/Scala Programming, Linux Admin, Data Wrangling, Data Cleansing & Data Extraction services utilizing Python 3 or Python 2 Programming or Scala/Spark on Linux or Windows. We slice, dice, extract, transform, sort, calculate, cleanse, collect, organize, migrate, and otherwise handle data management for clients. Languages: - Scala - Java - Python Cloud Solutions: - AWS - GCP Tools: - Spark - Elastic search - Mysql - MongoDB - Kafka - Airflow - Pandas - Numpy - Delta lake - Hudi - Presto/Trino - Apache Flink - Apache pinot - Neo4j - DynamoDB - K8s Services Provided: - Big data processing using Spark Scala - Building large Scale ETL - Could Management - Distributed platform development - Machine learning - Python Programming - Algorithm Development - Data Conversion (Excel to CSV, PDF to Excel, CSV to Excel, Audio) - Data Mining - Data extraction - ETL Data Transformation - Data Cleansing - OCR (Optical Character Recognition w/ Tesseract) - Linux Server Administration - Anaconda Python / Conda / Miniconda Administration - Linux Containers - Website & Data Migrations As long-time data engineers, our technical experience includes the gamut of skills required to get an ETL up and running. From server design & construction, datacenter selection, server colocation, web server software setup/configuration (Apache, NGINX), database (MySQL), server control panels, server migrations & etc