Reliability [h1.location_city]
[job_alerts.create_a_job]
Reliability • san mateo ca
- [promoted]
Senior Site Reliability Engineer Cloud Platform
ZillizRedwood City, CA, US- [promoted]
Senior Site Reliability Engineer
IXL LearningSan Mateo, CA, US- [promoted]
Software Engineer, Site Reliability
RobloxSan Mateo, California, United StatesLead Software Engineer- Middleware Reliability Engineering
VisaFoster, California, USAStaff Site Reliability Engineer
SolarWindsSan Mateo, California- [promoted]
Reliability Technician
Mentor Technical GroupMillbrae, CA, US- [promoted]
Reliability Engineer
PeriodiclabsMenlo Park, CA, United States- [promoted]
Sr Reliability Engineer
Hippocratic AISan Mateo, CA, USSite Reliability Engineer Intern
OracleRedwood City, California, USA- [promoted]
AI Evaluation & Reliability Engineer
Saxon GlobalRedwood City, CA, United States- [promoted]
Senior SRE Engineer - Reliability & Scale
Roblox CorporationSan Mateo, CA, United States- [promoted]
Site Reliability Engineer
ZooxFoster City, CA, US- [promoted]
Senior Reliability Engineer, South Campus - Site Services
GenentechSouth San Francisco, CA, United States- [promoted]
Reliability Engineer
Robust.aiSan Carlos, CA, US- [promoted]
Senior Hardware Reliability Engineer - Design for Reliability
ZiplineSouth San Francisco, CA, US- [promoted]
Staff SRE (Reliability Engineering)
PowerToFlyRedwood City, CA, United States- [promoted]
Reliability Engineer
Periodic LabsMenlo Park, CA, United States- [promoted]
Senior Site Reliability Engineer
CaptivateIQMenlo Park, CA, USSenior Site Reliability Engineer
IXLSan Mateo, CASenior Site Reliability Engineer Cloud Platform
ZillizRedwood City, CA, US- [job_card.full_time]
Job Description
Job Description
Zilliz is a fast-growing startup developing the industry’s leading vector database company for enterprise-grade AI. Founded by the engineers behind Milvus, the world’s most popular open-source vector database, the company builds next-generation database technologies to help organizations quickly create AI applications. On a mission to democratize AI, Zilliz is committed to simplifying data management for AI applications and making vector databases accessible to every organization.
What you will do :
- Work at the intersection of development and site reliability. Creating SRE tools and systems, as well as supporting existing infrastructure and platforms.
- Ensure the reliability, availability, and performance of Zilliz’s distributed database systems.
- Develop and implement strategies for monitoring, incident management, and disaster recovery.
- Automate system operations and maintenance tasks to improve efficiency and reduce manual intervention.
- Design and build tools to manage and monitor infrastructure, ensuring scalability and robustness.
- Collaborate with software engineers to enhance system reliability, scalability, and performance.
- Maintain and improve the CI / CD pipeline to ensure smooth and rapid deployment of changes.
- Actively contribute to the Milvus Vector Database open-source community, focusing on improving reliability and operational efficiency.
What we are looking for :
Zilliz is an Equal Opportunity Employer and welcome people from all backgrounds, experiences, abilities, and perspectives. All qualified applicants will receive consideration for employment regardless of race, color, national origin, religion, sexual orientation, gender, gender identity, age, physical disability, or length of time spent unemployed.
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.