You will get data extracted from PDFs into the data format you want

Yigit G.Status: Offline
Yigit G.

Let a pro handle the details

Buy Data Mining & Web Scraping services from Yigit, priced and ready to go.
Yigit G.Status: Offline
Yigit G.

Let a pro handle the details

Buy Data Mining & Web Scraping services from Yigit, priced and ready to go.

Project details

With 5+ years of experience on automated data extraction over PDF files, I've built SpeedyF which reduces hours/days of work to minutes. By utilizing SpeedyF, I can deliver captured clean data from thousands of PDF pages.

Please see the images on what types of data I can extract using SpeedyF.

This pricing and deliverables are intended to serve basic needs only. If you
 • have many PDF files or/and large documents
 • need ongoing PDF extraction
 • need integration of SpeedyF into your system
then let's have a chat.

Feel free to contact me if you need additional information.
Data Tool
Python

What's included $150

These options are included with the project scope.

$150
  • Delivery Time 1 day
  • Number of Pages Mined/Scraped 100
  • Number of Sources Mined/Scraped 1
  • Number of Revisions 0
Optional add-ons You can add these on the next page.
Additional Page Mined/Scraped (+ 1 Day)
+$1
Additional Source Mined/Scraped (+ 1 Day)
+$50
Additional Revision
+$20
Data normalization
+$20
Yigit G.Status: Offline

About Yigit

Yigit G.Status: Offline
Senior Python Developer | Data Engineer
Mugla, Turkey - 8:48 am local time
I'm a Python developer with 12 years of experience. I've lead teams and built many robust products. I'm a freelancer since April 2022 and recently joined Upwork.

My main focus is building pipelines that makes hidden/buried data useful, including design and development of
🔹 Data extraction systems by use of Machine Learning
🔹 Backend and database models
🔹 User interfaces to monitor or take action on the pipeline/data


NOTABLE PRODUCTS
▔▔▔▔▔▔▔▔▔▔
① Data warehouse to collect extensive data of financial vehicles through various data sources, which includes:
⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣ ⦾ Data agnostic PDF extraction engine to process thousands of pages daily
⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣ ⦾ Data agnostic internet bot manager to perform actions over web pages and collect data
⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣ ⦾ Text extraction engine to capture data from emails
⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣ ⦾ Event driven workflow engine to orchestrate the data pipeline
⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣⁣ ⦾ Backend/Frontend for user interventions on workflow and data monitoring

② Data warehouse to capture emotions of actors/actresses from TV shows or movies by use of videos and screenplay scripts

③ Patented audio content recognition (ACR) technology to engage viewers of TV shows/movies via smartphones

PROGRAMMING LANGUAGES / TECHNOLOGIES
▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔
Python: pandas, numpy, sklearn, TensorFlow, OpenCV, sns, Flask, Django, Selenium, nltk, SpaCy, celery, Spark, AirFlow, Kafka, RabbitMQ, gunicorn

JavaScript: Node.js (express, Apollo, Axios), React (hooks, MaterialUI)

DATABASE SYSTEMS
▔▔▔▔▔▔▔▔▔▔
NoSQL: MongoDB, AWS DynamoDB
SQL: MSSQL, MySQL, SQLite
Graph: Neo4j
Logging: Graylog

CLOUD TECHNOLOGIES
▔▔▔▔▔▔▔▔▔▔▔▔
Amazon Web Services: EC2, Lambda, S3, RDS, DynamoDB, Rekognition, Transcribe, EBS, CodePipeline, SQS, SNS, IAM

VERSION TRACKING / ISSUE MANAGEMENT
▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔
Github, BitBucket, Jira

Steps for completing your project

After purchasing the project, send requirements so Yigit can start the project.

Delivery time starts when Yigit receives requirements from you.

Yigit works on your project following the steps below.

Revisions may occur after the delivery date.

Review extraction requirements

This is a tentative step to clearly understand the data, in case anything is vague.

Review the work, release payment, and leave feedback to Yigit.