Overview
FriendliAI, based in San Mateo, CA, is building the next-generation AI inference platform that powers large language and multimodal models with unmatched performance and usability. Our infrastructure delivers high-throughput, low-latency AI workloads for organizations worldwide, and we integrate seamlessly with Hugging Face, giving instant access to over 440,000 open-source models.
We are on a mission to deliver the worlds best platform for generative and agentic AI.
The Role
Were seeking a Backend Engineer to design, build, and scale our web platform, which serves as the core interface for deploying multimodal models, observing workloads, and building agent workflows. Youll collaborate closely with product, infrastructure teams to create high-performance, developer-friendly, and enterprise-ready tools.
The Person
We are seeking a hands-on engineer who is a talented backend stack developer, eager to work at the intersection of infrastructure, developer experience, and AI applications. A great candidate is a strong collaborator who enjoys working across the stack, cares deeply about developer workflows, and is eager to help define the future of AI adoption.
Key Responsibilities
- Design, build, and maintain web applications and tools for AI model deployment, monitoring, and performance optimization
- Develop clean, scalable, and robust APIs powering AI agents, workflows, and user-facing systems
- Collaborate with infrastructure engineers to integrate backend systems with deployment and orchestration pipelines
- Drive code quality through automated testing, CI / CD, and code reviews
- Contribute to architecture and design decisions that shape our platforms long-term direction
- Identify and resolve technical debt and improve system reliability in production systems
Qualifications
5+ years of industry experience in full-stack or backend engineeringBachelors or Masters degree in Computer Science, Computer Engineering, or equivalentStrong backend experience with FastAPI or similar Python frameworksProficiency in designing data models, writing SQL, and working with PostgreSQLDeep understanding of modern web frameworks and component-driven architectureStrong API design experience across gRPC / REST / GraphQL in production systemsSolid foundation in cloud-native developmentFamiliarity with OpenTelemetry tracing, metrics, and structured loggingKnowledge of web security, authentication, RBAC, and multi-tenant SaaS systemsPreferred Experience
Familiarity with LLM-based workflows, tool invocation, or agentic systemsFamiliarity with Kubernetes for container orchestration, including deploying, scaling, and managing containerized applications in production environmentsHave worked in a startup or fast-paced environments with ownership and ambiguityPassion for developer experience and enabling AI adoptionDaily lunch and dinner providedUnlimited snacks and beveragesSupportive work environmentWe offer competitive compensation, startup equity, health insurance, and other benefits.#J-18808-Ljbffr