Data Scraping Jobs

592 were found based on your criteria {{ paging.total|number:0 }} were found based on your criteria

show all
  • Hourly ({{ jobTypeController.getFacetCount("0")|number:0}})
  • Fixed Price ({{ jobTypeController.getFacetCount("1")|number:0}})
Hourly - Entry Level ($) - Est. Time: 1 to 3 months, 30+ hrs/week - Posted
We are looking for a developer to create a search tool for us that lets us search Youtube video producers by country, keywords in their details section and keywords in titles and descriptions of videos. Results will show list of producers, links to youtube pages, total videos made, total views and followers. - Instagram and Twitter search for influencers by location and keywords. Results give influencer list with their country, total followers, total posts links to each social page.
Skills: Data scraping API Development HTML HTML5
Fixed-Price - Intermediate ($$) - Est. Budget: $200 - Posted
I need someone to find a list of all (25 hours worth) Companies in the Southwestern quadrant of the map of Pennsylvania that are tenants in or owners of commercial buildings. I have attached a map for reference. Pretty much if you draw a line down and across the center of the map to make 4 quadrants, the businesses in the counties of interest would be in the southwest (bottom left) quadrant. The task is essentially to make a spreadsheet of the Company, address, phone number, CFO or director of finance, CEO, office Manager (if applicable) and any other applicable notes. This will serve as my call list so i need as much information as possible.
Skills: Data scraping Data Entry Data mining Internet research
Fixed-Price - Intermediate ($$) - Est. Budget: $40 - Posted
I have a list of 698 names and addresses that I need Phone numbers and email addresses for. I will send an excel Spread sheet with the information that I currently have. You will receive further instructions once hired. I need this done by Monday.
Skills: Data scraping Spreadsheets Web scraping
Fixed-Price - Intermediate ($$) - Est. Budget: $50 - Posted
We currently use a Daily Call tracker that lives in Google Sheets and is updated via Google Form with some of the information input manually. We are having several issues with the sheet and need someone to correct the formulas for the following: -Under the Monthly Snapshot tab, odd dates from 1899 are showing up. I'm not sure if this is because we've added people to our team, but this would need to be corrected - We changed the time zone on the sheet to be correct for our local time (US - Central), make the sheet run from Monday-Sunday each week rather than Sunday-Saturday - Lastly, there are a couple of people who left the team that were not here long enough to have a lot of submissions such as Colby Long and Kelly Talavera. Would it be possible to delete their form submissions? I'm happy to answer any questions about the project ahead of time to ensure the person we hire can do the work. Person hired will have the Google Sheet and Form shared with them. Again, this is an existing sheet that just needs to be updated. Thanks, Allison
Skills: Data scraping Web Crawler Web scraping
Fixed-Price - Entry Level ($) - Est. Budget: $100 - Posted
I need data mined from publicly available websites so that it can be formatted in an Excel spreadsheet for analysis. Output can be sent as an MS Excel file (preferred) or OpenOffice Calc. All relevant data from the websites, both current and archives, should be retrieved and stored in the file. See the following link to see what I need. https://docs.google.com/spreadsheets/d/11JyIeYihYQqN-P09O8QWzgeoM3vwLB5R0vM1M1gBgso/edit?usp=sharing Please message me if you have questions or comments. My choice of applicant will be made by mid-next week. Important factors for me will be: 1) Communication with me must be timely and courteous. 2) Accuracy and Completeness of Output must be assured. 3) Proposed Budget (a reasonable, low fixed price is preferred) 4) Proposed Timeline (the sooner the better, as long as #2 is met)
Skills: Data scraping Data mining Microsoft Excel
Fixed-Price - Entry Level ($) - Est. Budget: $200 - Posted
PROJECT: Facility information needed from Electric Wire Service Providers. Data collection, pivot tables, mapping, data referencing, investigative work through the internet, knowledge of power markets is a benefit. May require you to contact Wire Service Providers for further information. With site ID data that you obtain from Wire service providers – take the legal land description and find out what type of facility is operating there, who owns it, what do they do at that location (for example: does the process chemicals, generate power, have a data center) – find the contact information of the operations manager for that location or business. This is investigative work that requires mapping. Here is an order of operations to help complete the deliverables on the job: 1. Get Wire Service Provider List: http://www.hme.ca/connecttothegrid/Map%20showing%20Alberta's%20Electric%20Distribution%20System's%20Owners.pdf -Atco -Epcor -Fortis -Altalink 2. Download Retail Site Catalogue for Each Provider – you may have to investigate where to find the reports and get the Site ID data in CSV file but it is available Site ID – 13 digit # number identifies the meter and address Here is the link for Fortis http://www.fortisalberta.com/for-business-industry/retailers 3. With the site ID data – take the legal land description and find out what type of facility is operating there, who owns it, what do they do at that location (for example: does the process chemicals, generate power, have a data center) – find the contact information of the operations manager for that location or business. This is investigative work that requires mapping. 4.Generate an excel report that allows us to: a).filter by type of business. Limit its operation to one word. Either consumer, generator, producer, manufacturer, b). give contact information c). give operation managers name and contact information d).The location of the site e). Clarify that it is a transmission connected site.
Skills: Data scraping Data Entry Digital Mapping English
Fixed-Price - Intermediate ($$) - Est. Budget: $300 - Posted
Objectives: The objective of this project is to create a tech startup business news classifier that from the content of a news article (title and body) returns the categories of industry the news is about. Deliverables: - URL of the created MonkeyLearn classifier. - CSV file with the training data, named "dataset.csv". - Text file with the summary, named "summary.txt". Process: The process will be divided into five major steps. Please read all the following steps before starting: 1- Data Gathering Obtain training data, that is, articles about business news. The following sources are recommended: - Techcrunch - Venturebeat - Recode.net - The Next Web - Wired - Gizmodo - Cnet - The Verge The data shall be saved in a CSV file with the following format: Title Title + Content Date Author URL 2 - Data Tagging After gathering the articles, you'll have to tag the articles into the following categories: - AI/Machine Learning: News about companies in AI/Machine Learning sector - Internet of Things: News about companies in the IoT sector - FinTech: News about companies in FinTech sector - AR/VR: News about companies in AR/VR sector - ChatBots: News about companies in ChatBots sector - Robotics: News about companies in Robotics sector - Driverless Cars: News about companies in Driverless Cars sector - Bitcoin/Blockchain: News about companies in Bitcoin/Blockchain sector - Shared Economy (Uber, Airbnb, etc): News about companies in Shared Economy sector - Social Media (Facebook, Twitter, Snapchat, etc): News about companies in Social Media sector - Messaging Apps (Whatsapp, Telegram, etc): News about companies in the Messaging sector - 3D printings: News about companies in 3D printings sector - Health: News about companies in Health sector - Ad: News about companies in Ad tech Space - Ecommerce/Retail (Amazon, etc): News about companies in Ecommerce/Retail sector - Drones: News about companies in Drones Space - Mobile (iOS, Android, apps, etc): News about companies in Mobile sector - Gaming: News about companies in Gaming sector - Gadgets: News about Gadgets (like iPhone, chromecast, Snapchat Spectacles, Galaxy Note 7, Apple Watch 2, Sony MDR-1000X Wireless Headphones) The data shall be saved in a CSV file with the following format (note that we're adding the last column): Title Content Date Author URL Category Initially tag at least 20 articles per category, including samples that belong to more than one category, tag them accordingly. 3- Create the classifier Register a free account at http://www.monkeylearn.com/ Create a Classifier by clicking in the button "+ Create Module" on the top bar. Step 1/3 select: Name: Tech Business News, Industry Classifier Permissions: Public Module Type: Classifier Step 2/3 select: What are you working on?: Web Scraping What are you going to do?: Topic Categorization Step 3/3 select: On what kind of text?: News articles Which is your text language?: English Advanced options: Is multilabel Upload the data Click the Upload button on the Sandbox / Samples section and selecting the CSV file with the data that you gathered and tagged. Train After uploading the data, if you go to the Sandbox / Tree section, you'll see the categories uploaded in the Category Tree area. To train the classifier, just click the Train button on the right: 4 - Testing the Classifier After the model is trained, you will see a series of metrics in the Statistics area that show how well the classifier would predict new data: Accuracy Precision Recall You can actually test the classifier with particular texts by using the Classify section, just type or paste and text and click the Classify button. You will receive the corresponding prediction by the classifier. 5 - Classifier Development and Improvement In order to achieve the minimum accuracy, precision and recall required, you can iterate the process from step 2 to 4, that is: Gather more data (if required) Tag more data Upload the new data and retrain the classifier Test to see the improvement. Please read the guide How to successfully create classifiers with MonkeyLearn to learn how to perform this process. (We will provide you with this guide once accepted). Expected Results The expected results are: A working classifier (trained) with the specified categories created in MonkeyLearn platform as described. The classifier must fulfill the following requirements: At least 200 training samples per category. At least 90% of accuracy. At least 85% of precision on each category. At least 85% of recall on each category. The CSV file with the training data as specified in the step "2 - Data Tagging" . A brief summary (max two paragraphs) of the process used to gather and tag the data (tools and techniques used). The maximum number of hours to complete this task is 40hs. Deliverables - URL of the created MonkeyLearn classifier. - CSV file with the training data, named "dataset.csv". - Text file with the summary, named "summary.txt".
Skills: Data scraping Data Science
Fixed-Price - Expert ($$$) - Est. Budget: $3,500 - Posted
I am looking for someone who is very fluent in website java and making a nice GUI program that i need for certain websites and a program made for it. It will be a multithreaded program and java and python and c++ and alot of experience on how shopping websites work is needed and skilled person with website elements and scripts and a program that cant be activated and deactivated with wyday and license key and more. Need the best sneaker botdevelopment for certain things based on sneakersite Will explain most thru communication. I will also need updates as sometimes they update stuff and i would need my program updated as well and also looking for someone to teach me how to do simple updates like site keys and site tokens etc when they change time to time and fix things when the program fails to work on that day. Overall i need someone who i can trust and who is skilled in website elements backends ways and more things we can discuss and making a program thats multithreaded and fast for purposes i need. I already have few programs that gives you an example what i want so you have an idea but i need to make it better and more options added and better ways found to have better successful program for my needs adding shoes intocarts. So need a relationship also so we can continue to work together and work on other things since i will be needed few things done and willing to pay for quality program that works with great success of addingshoes on certain websitesneaker related and willing to learn things as part of the project such as finding sitekeys/find pid and earlylinks etc and working with elements but all these after we make a program first but want to keep working with a same person for future projects also since i want to have few things made . So basically i need to remake a program i already have made more efficient or find a different way thats more efficient of adding something tocart on a hyped hard release for sneakers and commitment so we can work once a week/month after the project is finished to keep updated and keep growing and upgrading if needed and off course i will pay for those times as well and i will explain more to what you are capable of doing and And the program should be able to be activated and deactivated by a license key and So if youre good at making a nice script to do certain things similar to the program i already have and you must be expericed and have good refs and honest and have time to go over things with me on weekly basis even after its made and those will be paid seperately after the program is done and paid for. I am looking for quality and on time and very easy to get in touch with everyday if needed. I need someone who is intelligent with ideas and etc. Thank you Please contact to discuss the project.
Skills: Data scraping API Development Bot Development HTML