Pranav is a data person helping enterprise, small business, and startups solving their data problems at scale. Graduate from Indian Institute of Technology, IIT Guwhati, carries 12 years of significant experience working on data engineering and data science. Pranav had been associated with esteemed businesses including Invitation Homes, Kaiser Permanente, Zurich Financial Services, Hughes Depot Supply, Boeing, Nielsen, Adap TV, Ecolab, CNA Financial, to name very few in list. OFFERING ------------- 1. Data Ingestion - Ingesting data from variety of sources to a big data ecosystem. - Data sources can be operational systems (CRM, ERP, SCM, CMS), Databases ... (Relational, NO SQL), Streaming (Machine Logs, Social Media, Event Processing), Files (Word, Excel, CSV, PDF, Text) - Big Data ecosystem can be Apache Hadoop or Apache Spark 2. Data Scale - Moving data to a big data ecosystem - Parallel processing through Apache Spark - Storing data to NO SQL databases for pre-processing 3. Data Wrangling - Data Lake development including data pipelines - Pre-processing tasks including data parsing, data profiling, data shaping, data inferencing, data enrichment, data harmonization, data cleansing, data transformation, data combining 4. Data Context - Natural Language Processing of un-structured and multi-structured data Entity selection, Featurization, Part-of-Speech, Lemmatization, Stemming, TF- IDF, Scoring - Entity Resolution, Linked Data, Semantic Web, Third Party enrichment, Ontology, knowledge graph, Word Disambiguation 5. Data Exploration - Exploratory data analysis to summarize dataset characteristics - Mining the hidden patterns in data - Statistical analysis concluding right analytical model 6. Data Analytics - Build a machine learning model meeting requirements - Validating and Testing the data model prepared 7. Data Communication - Analyzing results via Visualization, D3.JS or Tableau - Providing actionable insights through automation SKILLS --------- 1. Framework: Apache Hadoop (MapReduce, YARN, HDFS) 2. Distributed Programming: Apache Spark - Parallel processing of large scale data, batch or streaming - In-memory computation of big data in seconds or less - Machine learning implementation - Graph data analysis - Realtime streaming - Processing in Python, Scala, and R 3. Key Map Data Mode: Apache HBase, Apache Cassandra 4. Key Value Data: Reddis 5. Graph Database: Apache Giraph, Neo4J 6. Document Data Model: MongoDB 7. Data Ingestion: Apache Flume, Apache Sqoop 8. SQL-like Processing: Apache Hive, Apache Impala 9. Message Oriented Middleware: Apache Kafka, RabbitMQ 10. Service Programming: Apache Avro, Apache Zookeer 11. Scheduling: Apache Oozie 12. Applications: Apache Nutch, Apache Tika 13. Search Engine Framework: Apache Solr, Apache Lucene 14. Data Visualization: D3.JS, Tableau, Qlikview, MicroStrategy 15. Distribution: Cloudera, HortonWorks, MapR, Pivotal SPECIALITY --------------- 1. Marketing Data Handling marketing automation, CRM data, POS data, click-through data, social media, second & third party data 2. Marketing Analytics - Customer Loyalty - Customer Churn - Customer Segmentation - Upsell/ Cross-sell - Discount Targeting - Customer Behavior 3. Marketing Application Data Management Platform, DMP for marketers, publishers, & agencies Communication ============= I possess excellent English skills and and use Skype/ TeamViewer/ GotoMeeting/ Join.me to communicate with my clients on Upwork. Availability ========= Available from 9:30 AM (IST) to 1:00 AM (IST) for my clients from different time zones. Contact ====== Please send your queries to firstname.lastname@example.org or Skype at vpranav121
Pranav Verma has added 14 portfolio pieces. Create an account to review them.
Pranav Verma has more jobs to show. Create an account to review them.