
Distributed Systems Engineer
Posted 1 hour ago

Posted 1 hour ago
This is a fully remote position, open to applicants in United States.
• Design and develop distributed systems capable of handling latency, bandwidth limitations, and intermittent connectivity.
• Implement communication strategies that are fault-tolerant, including retry logic, backpressure, caching, and eventual consistency patterns.
• Create maintainable, resilient, and thoroughly tested code in accordance with development standards and methodologies.
• Troubleshoot and enhance system performance, addressing networking and distributed coordination challenges.
• Contribute to systems primarily developed in Rust.
• Engage with system-level issues such as scheduling, memory management, I/O optimization, storage hierarchy management, and system reliability.
• Enhance performance and optimize memory usage in resource-constrained environments.
• Address concurrency issues and challenges related to distributed coordination.
• Design and manage APIs and communication layers for distributed components.
• Identify and mitigate tight coupling between services and systems.
• Diagnose and rectify cross-system failures in production settings.
• Create and implement secure, reliable solutions that align with engineering standards.
• Collaborate with engineers and computer scientists on operating systems internals, compiler design, fault tolerance, file system architecture, and trusted systems.
• Assist in resolving architectural and systemic challenges.
• Continue to enhance expertise in distributed systems, HPC infrastructure, and associated tooling.
• Proven experience in building distributed systems within environments characterized by low bandwidth, high latency, or unreliable communication links.
• Hands-on experience in developing systems using Rust or Go in production settings.
• Knowledge of failure modes in distributed systems and effective mitigation strategies.
• Familiarity with consistency models, coordination strategies, and state replication.
• Experience in designing APIs and communication layers for distributed components.
• Background in working within established architectures and delivering components of production quality.
• Understanding of systems-level concepts such as durability, reliability, and operational behavior.
• Ability to work independently while effectively collaborating with technical leadership.
• Preferred Qualifications
• Experience with HPC environments, exascale computing, or AI/ML infrastructure.
• Exposure to operating systems internals, compiler design, or language runtimes.
• Experience with edge computing or environments with constrained networks.
• Familiarity with message queues, event-driven systems, or streaming architectures.
• Exposure to consensus algorithms or distributed coordination primitives.
• Expertise in concurrency, memory management, or performance optimization within production systems.
• Experience contributing to developer tooling, internal platforms, or infrastructure-layer components.
• Remote work opportunities available within the United States.
• Chance to engage in projects involving distributed systems that support AI and exascale workloads.
• Collaboration with engineers proficient in operating systems internals, compiler design, fault tolerance, file system architecture, and trusted systems.
• Exposure to AI infrastructure, HPC, and large-scale distributed computing environments.
Instacart
CLASP
Tailor
Get handpicked remote jobs straight to your inbox weekly.