
Staff Software Engineer
Posted May 28

Posted May 28
This is a fully remote position, open to applicants in New York.
β’ Recognize systemic engineering challenges across our platforms and facilitate their resolution β influencing the technical backlog and immediate architecture.
β’ Suggest and validate technical solutions for issues related to scale, performance, security, or inter-team dependencies.
β’ Drive architectural decisions for complex, ambiguous, or high-risk projects.
β’ Integrate modern industry patterns β including AI/ML tools β into our technical strategy where they significantly impact outcomes.
β’ Write code, review pull requests, debug production issues, and optimize system performance β this role involves more than just theoretical discussions.
β’ Investigate our AWS infrastructure, Kubernetes workloads, and JVM-based services to uncover and resolve actual problems.
β’ Engage in our on-call rotation as a secondary escalation point for intricate engineering incidents.
β’ Assist during major incidents to aid teams in triaging, coordinating, and resolving issues β and conduct follow-up post-incident reviews that drive lasting solutions.
β’ Promote operational excellence across our engineering teams: observability, reliability, deployment practices, and the operational habits that maintain system health at scale.
β’ Collaborate with engineering teams as a technical point of contact on complex projects β ensuring sound architectural decisions are documented and do not need to be revisited.
β’ Work directly with Engineering Managers to align technical efforts with team and product priorities.
β’ Mentor engineers within our teams, elevating the technical standards through reviews, pairing, and direct feedback.
β’ Stay attuned to customer needs. Comprehend how Sailthru's platform impacts its users, incorporate that context into technical decisions, and advocate against engineering choices that create customer friction.
β’ Work closely with the product team β contribute to defining what gets built, not just how itβs built. The best technical decisions arise when engineering and product collaborate from the outset.
β’ Extensive, proven expertise in AWS and cloud-native architecture β adept at building reliable, scalable systems that handle real traffic volumes.
β’ Strong foundational knowledge of JVM languages, primarily Java and Kotlin, with the flexibility to work across languages as required by the problem at hand.
β’ Practical experience with Kubernetes β deploying, managing, and troubleshooting workloads in a production environment.
β’ Background in designing and managing distributed systems β including fault tolerance, consistency trade-offs, service coordination, and failure modes at scale.
β’ Comfort working with large-scale databases β including query optimization, schema design, operational considerations, and trade-offs between data storage methodologies.
β’ Hands-on experience with AI/ML engineering β whether it involves integrating LLM capabilities, constructing pipelines, or assessing where AI genuinely adds value versus complicating processes.
β’ A proven history of successfully executing large, complex projects in ambiguous settings β scoping, driving consensus, and delivering results.
β’ Strong debugging instincts with a preference for root-cause analysis over temporary fixes.
β’ The insight to discern when to advocate for the ideal solution and when to opt for a more pragmatic approach.
β’ Proficiency with AI-driven development tools β such as Claude Code, GitHub Copilot, Codex, or similar β is essential. We use these tools daily and expect staff engineers to utilize them effectively and model best practices around prompt discipline, output validation, and security.
β’ Ability to operate without a predefined playbook β you have tackled novel challenges before and know how to navigate uncertainty.
β’ Comfort working with legacy codebases β capable of reading unfamiliar code, understanding older systems, and making significant improvements without necessitating a complete rewrite.
β’ The ability to swiftly analyze an existing architecture β identifying bottlenecks, constraints, and opportunities β and converting that understanding into specific, actionable proposals that enhance team velocity.
β’ You think expansively. Youβre not limited by current practices β you can envision substantially better outcomes and work backwards to create a plan.
β’ Proactive approach. You prefer to act with imperfect information and adjust as necessary rather than waiting for certainty that may never arrive. You recognize the difference between reversible decisions and those that require more deliberation.
β’ Unlimited PTO
β’ Excellent medical, dental, and vision coverage
β’ Employee Equity and Stock Purchase Plan
β’ Employee Discounts, Virtual Wellness Classes, and Pet Insurance
β’ And more!!
Truelogic Software
Index Analytics LLC
ClickHouse
Nordson Corporation
Get handpicked remote jobs straight to your inbox weekly.