I am a Data Engineer with more than 10 years of experience in developing data pipelines, ETLs and data integration to extract data from various sources (GA4, Facebook Ads, Google Ads, APIs for various web applications, CSV, Excel, MySQL, PostgreSQL, SQL Server, Oracle, Mongo etc.) and load it to various data warehouse/ data lakes (Bigquery, Redshift, Snowflake, SQL Server, Oracle etc.) after performing necessary transformation using various data integration tools and services (Airbyte, Fivetrann, Google Cloud Function, Dataform, Dataproc, streaming services, AWS Glue ETL, SSIS, ODI etc.) in line with customer's business needs.
Moreover, I have got extensive experience in developing BI dashboards/reports using Google Looker Studio, Looker, Power BI, Tableau with actionable insight comprising of useful KPIs and metrics on top of the data (i.e. Financials, Supply Chain, CRM, Sales and Marketing(GA4, e-Commerce, Facebook Ads, Google Ad, Google Search Console etc.), HRMS and Projects) for diverse business verticals.
Skill Set:
• Designing and developing data model with best practices (i.e. dimension, metric, aggregations, level of granularity) on Google Bigquery and AWS Redshift
• Developing Data Pipelines and ETL Processes to ingest data from various data sources into data warehouse using Google Cloud Functions, Google Data Stream, Google Data Fusion and AWS Glue ETL
• Performing data transformation, data cleansing, data quality checks using Python and SQL
• Ensuring data consistency, integrity and accuracy by adopting the best data prep
cleansing practices (Outliers, Data Skewness, Null Values, Duplication)
• Developing dynamic and interactive BI dashboards comprising actionable isights with useful KPIs, metrics/measured using Google Looker Studio and Power BI
• Designing meaningful and user friendly visual elements on dashboard in line with modern UI/UX practices using HTML, CSS
• Implementing data security using role based access control to provision users with the access to their respective data using RLS
• Developing automated data pipelines in Python using APIs for various e-commerce/Web platforms
• Writing complex SQL queries for data blending, cleansing and transformation
Tools and Technologies:
• Data Warehousing ( Google Bigquery, AWS Redshift, Snowflake, Azure data warehouse, Tableau, Oracle)
• ETLs and Data Pipelines and Integration (Google Cloud Functions, Google Dataflow, Data Stream, Dataform, Dataproc, AWS Glue ETL, SSIS, Data Transfer Service, Google bq CLI scripts and custom procedures, Azure Data Factory, dbt, ODI etc. )
• Developing connectors using Airbyte, Fivetran for various sources and target connector
• Data Visualization and BI Tools (Google Looker Studio, Looker, Tableau, Power BI, Amazon QuickSight, OBIEE etc.)
• Databases(SQL Server, MYSQL, Postgres, Oracle, Snowflake, MariaDB, Redshift etc.)
• SQL & T-SQL, Power Query
• Python Programming for Data Pipeline, Scraping and Data Science
• Restful API Integration with various Web Applications
• Linux Shell Scripting
• Cloud Platforms (Google, AWS, Oracle, Azure)
• Salesforce Cloud (Leads to Conversion, Customer Contacts, Customer Acquisition and Retention)
• Oracle EBS (GL, AR, AP, FA, CM, Procuremnt), Oracle Hyperion (Budget Planning)