
Senior Gen-AI Engineer
Posted 2 hours ago

Posted 2 hours ago
This is a fully remote position, open to applicants in Romania.
• Design, develop, and maintain scalable APIs and backend services utilizing Python and FastAPI along with related frameworks.
• Construct, deploy, and enhance production-level LLM applications using providers like OpenAI and Anthropic.
• Create and implement comprehensive RAG solutions, encompassing vector databases, semantic search, retrieval optimization, and chunking methodologies.
• Develop and oversee secure, scalable MCP servers and AI infrastructure.
• Build and orchestrate multi-agent systems to automate intricate workflows and business operations.
• Generate, test, and refine prompts, agent instructions, and LLM interactions to enhance solution quality and performance.
• Utilize AI-assisted development tools (e.g., Claude Code, Cursor, GitHub Copilot) to expedite software delivery and improve engineering efficiency.
• Implement event-driven architectures, messaging frameworks, and real-time communication patterns.
• Monitor, troubleshoot, and optimize AI and backend systems for performance, reliability, scalability, and security.
• Collaborate with cross-functional teams to deliver cutting-edge AI solutions and establish engineering best practices.
• Over 8 years of experience in developing APIs with Python.
• More than 2 years of experience in developing and experimenting with LLMs.
• Daily, hands-on experience with AI-assisted and agentic coding tools (e.g., Claude Code, Cursor, GitHub Copilot, autonomous coding agents).
• Extensive experience with Python, especially in constructing REST APIs using frameworks such as FastAPI.
• Strong grounding in NLP and machine learning as they pertain to LLM system development.
• Significant experience working with main LLM model APIs (e.g., OpenAI, Anthropic).
• Proven experience in building, deploying, and securing MCP servers at scale.
• Knowledge of multi-agent systems and their uses in complex problem-solving scenarios.
• Capability in designing and implementing end-to-end RAG systems: vector databases, semantic search, retrieval quality, and chunking strategies.
• Proficient in prompt writing for diverse use cases.
• Experience with generative solutions that have been released to production at scale, beyond proof-of-concepts.
• Expertise in server-side events, event-driven architectures, and messaging systems.
• Strong critical thinking and systems thinking abilities, with a background in debugging, optimizing, and making informed engineering decisions across complex backend systems, rather than just resolving isolated issues.
• Solid understanding of security best practices for backend systems, including authentication and data protection.
• Employees have the option to work remotely.
Traffic Label Limited
ITRex Group
Get handpicked remote jobs straight to your inbox weekly.