Data Mining Jobs

479 were found based on your criteria {{ paging.total|number:0 }} were found based on your criteria

show all
  • Hourly ({{ jobTypeController.getFacetCount("0")|number:0}})
  • Fixed Price ({{ jobTypeController.getFacetCount("1")|number:0}})
Fixed-Price - Entry Level ($) - Est. Budget: $20 - Posted
I would like to create a medical reference guide of about 1000-3000 terms. I am compiling the terms and definitions from multiple websites/resources. What I need help with is to transcribe these tables from websites into word or excel. These terms will be organized into different categories, which I will make clear where they go. Additionally, you will have to make sure there are no repeats before you enter the term. Budget: $3 per 500 terms or $20 whichever comes first You will be paid based on terms inputted that are actually used. Terms may be removed for repetition or lack of usefulness.
Skills: Data mining Data Entry
Fixed-Price - Intermediate ($$) - Est. Budget: $450 - Posted
In internet there is a lot of paralell corpora like: http://opus.lingfil.uu.se/Wikipedia.php But this data as you can see is very noisy there are wrong translations in it, very poor, to other languages, some isbn codes, number, etc. I need automatic filtering tool that would remove poor data (with minimal loss possible) to provide parallel texts with quality simillar to: http://opus.lingfil.uu.se/OpenSubtitles2016.php The tool should be language independent. Finding best method will be some job for you but here is some tool that works good but far too slow, looses a lot of good data and requires a lot of manual parapeter adaptation: https://github.com/krzwolk/Text-Corpora-Adaptation-Tool/commits?author=krzwolk and it also doest not necessary filter data but selects in-topic-domian data. Notheless I belevie that methods used there (levenstein distance, perplexity, td-idf) may be useful. Most importantly in one step in data filtration I would like to use comparison to n-gram langauge model. https://en.wikipedia.org/wiki/N-gram I can provide language models, but I would need the tool to also using them to analyse if such sentence can be present is a langauge or should be filtered out. The tool should work under linux - programming langauge is not importantn to me
Skills: Data mining C C++ Data Analytics
Fixed-Price - Intermediate ($$) - Est. Budget: $200 - Posted
Please make sure you are applying to this job for full time developer and don't send any generic proposal as well. We need a full time web crawling,development,database management expert to do multiple works so please apply if you can provide more than 8 hours a day or more,if needed.We are looking for few people to be a part of our existing team and to win and grow your future skills as well. Requirement:- Crawling,Stack,Angular,.Net.Java,Data Scientist,Scraping,Data Mining,Management,Analytics,Wordpress,Joomla,Magento,experts can apply to this Job other skill will be ignore. Timing:- Should be in Indian Hours and Candidate should be able to reply fast and complete work before the deadline or On-time but if you are not able to provide any kind of above requirement then please don't bid. Hours: 20 hours or more per week. Price:- $5-10 Hours according to the skills. Long-term opportunity for perfect candidates. Thanks Vivek
Skills: Data mining Web Crawling Data Science Data scraping
Hourly - Entry Level ($) - Est. Time: Less than 1 month, Less than 10 hrs/week - Posted
We want database of Name, Age, Address, Phone number and email who are looking for medical tourism and Ayurvedic treatments in India. This data need to be got off from the Ayurved related websites and put into an excel spreadsheet. We are looking for at least 500 emails. You can provide data from different countries like U.S., U.K., U.A.E., Australia etc.
Skills: Data mining Lead generation Market research
Fixed-Price - Intermediate ($$) - Est. Budget: $500 - Posted
We are looking to extract hotel Income Statement / P&L data provided in PDF files (often times in inconsistent formats and layouts) into a consistent usable format that would allow them to be aggregated. - Attached P&Ls are examples of the variation in the format/layout the data comes from. In some cases, several years are on one page, in other cases, there is one PDF for each year. The way data is laid out varies significantly from one hotel to another. - While we have a number of files from which the initial batch of data should be extracted from, the system should be flexible enough to accommodate new layouts. - The attached Excel file is a template into which the extracted data should be saved or ideally, it could be saved directly into a database format, with each observation on its own row. - Since the naming conventions for certain line items are not always consistent across different hotel, the logic would have to accommodate that - e.g., both energy and electricity line items should be included under "Utilities / Energy" line item. - There are also intricacies related to zero vs. null to address to ensure that any aggregation properly reflects the underlying data. - We expect to have a workflow/tool in place that would allow us to do this data extraction on an ongoing basis on relatively large amounts of data (~50-100 P&Ls weekly). - The posted budget is a placeholder and the fixed-budget is up for negotiation based on better specification of the scope and deliverables.
Skills: Data mining Data scraping SQL
Hourly - Intermediate ($$) - Est. Time: Less than 1 week, 30+ hrs/week - Posted
Web Researcher Needed! Search the web: Find list of media contacts -- editors, journalists from newspapers and publications in Massachusetts (Boston, South Shore and Cape Cod) Need about 200 to 300 email addresses. Use your Data.com, Hoovers, ZoomInfo, etc account. Need done in one day. Will pay .05 to .10 per valid email.
Skills: Data mining Internet research Lead generation
Fixed-Price - Entry Level ($) - Est. Budget: $20 - Posted
Hi, I need 2 people who can research well and can call to UK. I need my excel list to be completed with contact information. Some are not available online so you have to call the establishment to get the information. This is urgently needed. There are only about 70-80 establishments left in the list. I will hire 2 people to part this so 1 can do about 40. Josefa,
Skills: Data mining Call Handling Internet research Microsoft Excel
Hourly - Entry Level ($) - Est. Time: Less than 1 week, 10-30 hrs/week - Posted
Do you have experienced data mining text and parsing through mountains of data? I am looking for someone who can help customize some prototypes which I am working on for a client. You must have advanced experience with rapid miner which is an open source tool. Excel, python, and advanced sql skills are extremely helpful Please reach out. many thanks, Paul
Skills: Data mining Rapid Miner SQL