
Senior SRE Engineer
Posted May 21

Posted May 21
This is a fully remote position, open to applicants in Brazil.
• Ensure system reliability and availability by architecting, designing, and implementing strategies that promote high availability, reliability, and fault tolerance for our infrastructure and applications.
• Oversee incident management by leading response efforts, diagnosing root causes, and executing preventative measures to reduce the likelihood of future incidents.
• Optimize performance by identifying bottlenecks, conducting performance analyses, and enhancing system and application efficiency.
• Drive automation by initiating projects and developing as well as maintaining tools, scripts, and frameworks to facilitate deployment, monitoring, and troubleshooting processes.
• Create periodic reports on system reliability, uptime, and performance metrics to offer insights and visibility to stakeholders.
• Work in collaboration with cross-functional teams to establish key performance indicators (KPIs) and develop reporting frameworks to monitor and assess system health and operational efficiency.
• Compile executive-level reports that summarize incidents, their resolutions, and suggestions for improvements.
• Share findings, trends, and recommendations with management and stakeholders, delivering actionable insights to enhance decision-making processes.
• Bachelor's degree in Computer Science or a related discipline.
• Proven experience in constructing SLIs, SLOs, and error budgets based on business requirements.
• Background in IT project management.
• Proficient in coding with Python, Java, Shell, Bash, or similar programming languages.
• Experience in supporting critical production services both in the cloud (AWS) and on-premises.
• Familiarity with network technologies and expertise in system, security, and network monitoring tools.
• Comprehensive technical knowledge of databases and the Linux operating system, including standards and best practices for ensuring service availability.
• Demonstrated proactive approach in identifying issues and opportunities for enhancement, automating processes through code, and addressing performance challenges programmatically.
• Advanced proficiency in English.
• Bradesco health and dental plan for you and your dependents, with no co-pay.
• Life insurance with enhanced coverage.
• Meal voucher and supermarket voucher.
• Home office allowance.
• Gympass — access to gyms and online classes.
• Trustly Club — discounts at educational institutions and partner stores.
• English program — online group classes with a dedicated teacher.
• Extended maternity and paternity leave.
• Birthday off.
• Flexible hours / remote-first culture — you can work from any city in Brazil.
• Welcome kit — we provide Apple equipment (MacBook Pro, iPhone) and additional perks; equipment may be purchased by employees under internal criteria.
• Annual bonus — eligibility for a discretionary annual premium based on company KPI achievement.
• Referral program — receive a reward if a referred candidate is hired.
Advanced Solutions International, Inc.
Stone
Replit
Soum
Get handpicked remote jobs straight to your inbox weekly.