Data Scientist required to automate data acquisition, localization research and presentation for proof of concept project.
In this role, you will use the urllib2 library to connect to a website, BeautifulSoup library to collect HTML, re (regex) library for parsing words and filtering out markup. Parsed content will be stored in an open source database of your choice. We provide development resources required to implement project.
Comfortable writing regular expressions (RegEx).
Screen scraping web content using cURL, wget, OCR, urllib2 re_lib, BeautifulSoup, XPath, Selenium, Splinter.