
Software Engineer, Data Infrastructure
Posted 6 days ago

Posted 6 days ago
This is a fully remote position, open to applicants in New York.
β’ Develop and uphold a high-performance data layer that our Modeling teams depend on for their training and evaluation tasks.
β’ Engage directly with petabyte-scale storage systems, along with the networking and performance challenges associated with them.
β’ Collaborate on a daily basis with top-tier researchers and engineers who excel in their respective fields.
β’ Over 4 years of experience in data storage infrastructure.
β’ Proficient in Python programming.
β’ Experience with Kubernetes, particularly in storage aspects (Persistent Volumes, CSI drivers, etc.).
β’ Ability to convert unstructured data into efficient datasets across various storage backends, including S3, GCS, and POSIX.
β’ Familiarity with distributed data processing frameworks such as Apache Beam, Spark, or Flink.
β’ [Nice-to-have] Knowledge of modern analytics tools like BigQuery, Airflow, or dbt.
β’ A genuine enthusiasm for AI.
β’ Comfort in navigating the unknown, coupled with a desire to create something truly innovative rather than just refining existing solutions.
β’ An open and inclusive culture and work environment.
β’ Collaborate closely with a team at the forefront of AI research.
β’ Weekly lunch stipend, along with in-office lunches and snacks.
β’ Comprehensive health and dental benefits, including a dedicated budget for mental health support.
β’ 100% Parental Leave top-up for a maximum of 6 months.
β’ Personal enrichment benefits for arts and culture, fitness and well-being, quality time, and workspace enhancements.
β’ Flexible remote work options, with offices located in Toronto, New York, San Francisco, London, and Paris, as well as a co-working stipend.
β’ 6 weeks of vacation (30 working days!).
VPS
Tango
Influur
Salesloft
Get handpicked remote jobs straight to your inbox weekly.