Remotery

Web Scraping Specialist

atMLabsUS flagNew YorkFull-timeUncategorizedMid-levelSenior$75k – $100k/year

Posted 6 days ago

This is a fully remote position, open to applicants in New York.

📋 Description

• Code Development: Develop, test, and enhance high-performance code to extract data from diverse online sources, ensuring optimal reliability and efficiency.

• Data Retrieval: Oversee intricate data retrieval operations, including managing pagination and dynamic content loading via AJAX.

• Data Quality: Cleanse and format the extracted data to guarantee it adheres to stringent quality standards for subsequent analysis and processing.

• Database Management: Store and oversee scraped data in suitable databases, optimizing for both access speed and long-term data integrity.

• Monitoring and Maintenance: Continuously supervise scraping processes and infrastructure to detect and rectify issues, ensuring a steady and uninterrupted data flow.


⛳️ Requirements

• Extraction Expertise: Proven capability to extract data from complex websites with minimal oversight, backed by a portfolio of previous projects.

• Technical Proficiency: Advanced knowledge of Python or JavaScript, particularly with libraries and frameworks such as BeautifulSoup, Scrapy, or Selenium.

• Advanced Programming: Strong understanding of asynchronous programming, multithreading, and distributed scraping architectures.

• Web Fundamentals: Comprehensive knowledge of HTML, CSS, JavaScript, and the Document Object Model (DOM).

• Data Storage: Familiarity with NoSQL databases (e.g., MongoDB, Cassandra), including the ability to design efficient storage solutions.

• Cloud Infrastructure: Experience in deploying and managing large-scale scraping tasks using cloud services such as AWS, Google Cloud, or Azure.

• Preferred Skills: Capability to implement machine learning algorithms for data cleaning, categorization, or predictive analysis; active involvement in relevant open-source projects.


🏝️ Benefits

• Competitive Compensation: A highly competitive salary ranging from **$75,000 to $100,000**, complemented by a comprehensive benefits and equity package.

• Impactful Work: The opportunity to work at the forefront of AI development and web-scale knowledge graph creation.

• High-Output Culture: A professional environment that prioritizes low ego, technical autonomy, and rapid execution.

• Remote Flexibility: This is a remote position requiring a 6-hour overlap with the core team's schedule.

People also viewed

Digital Federal Credit Union3 hours ago

Information Center Loan Specialist I

US flagNew Hampshire, +1 more stateFull-timeUncategorized$21 – $24/hour
ApplyView job
UMS SKELDAR3 hours ago

Pilot to UMS Skeldar

SE flagSweden OnlyFull-timeUncategorized
ApplyView job
Lucet3 hours ago

Advanced Practice Telehealth Provider – Nurse Practitioner / Physician Assistant

US flagNew York OnlyFull-timeUncategorized$115k – $130k/year
ApplyView job
NJM Insurance Group4 hours ago

Managed Care Coordinator I

US flagNew Jersey OnlyFull-timeUncategorized$44.6k – $59.7k/year
ApplyView job
Hunt St4 hours ago

Electrical Estimator

PH flagPhilippines OnlyFreelanceUncategorized$1,800 – $2,200/month
ApplyView job
VF Corporation5 hours ago

Field Service Representative

US flagFlorida OnlyFull-timeUncategorized$60k – $75k/year
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers