Tejas B.

Big Data Engineer

I'm a highly accurate Data Engineer adept at collecting, analysing and interpreting large datasets, developing and performing data management tasks.Good scripting skills using Python/shell scripting.Good interpersonal/communication skills in a team-based environment and individual contributor roles. PERSONAL PROJECTS Twitter Data Pipeline In this project, i try to extract data of specific user using Twitter API (Tweepy). Then use python to transform data, deploy the code on Airflow/EC2 . After transformation save the final result on Amazon S3. Real-Time Spark Streaming In this project, i use free data set as source data then use kafka streaming to ingestion of data in Spark Streaming application. Ones the data is in streaming applicatioin, transformed data in two categories Raw data in Nosql database Cassandra and Processed data in Mysql. At the end for create Dashbords use Superset dashboarding application. Python Project of ETL During this project i had perform specific tasks such as extracting data from different file formats, collecting data through APIs and webscraping and finally transforming the collected data into a ready-to-load format .

Tejas B. has more jobs. Create an account to review them


  • SQL
  • Apache Airflow
  • ETL Pipeline
  • Microsoft SQL Server Programming
  • Linux
  • Hive
  • Database Management System
  • AWS Glue
  • AWS CodePipeline