Site Reliability EngineerWritemed • San Francisco, California, United States

Site Reliability Engineer

Writemed • San Francisco, California, United States

[job_card.variable_days_ago]

[job_preview.job_type]

[job_card.full_time]

[job_card.job_description]

About Us

Would you like to join one of the fastest-growing organizations with a goal of using the latest AI, GenAI, LLM, Cloud, and Digital Technologies to advance drug development and improve patient care pathways? WriteMed.AI helps Biopharma and Life Sciences companies reduce time to write medical publications and regulatory paperwork.

Site Reliability Engineer

Location :

Atlanta, GA; Miami, FL; Cambridge, MA; San Francisco, CA; Towson, MD

Role Overview

Our technical team supports our customers’ missions with a spirit of innovation across all technologies, including AI, GenAI, LLM, Compute, Storage, Database, Big Data, Application-level Services, Networking, Serverless, Deployment, Security, and more. This is an opportunity to partner with our principal AI Architects, Data Scientists, and Engineers to maintain a robust and secure technical foundation for our customers, ranging from small Biotech companies to large Pharmaceutical firms.

Qualifications

Passionate about learning and evolving with current technological trends

Engineering degree or related technical discipline, or equivalent work experience

Experience coding in higher-level languages (e.g., Python, JavaScript, C++, or Java)

Knowledge of Cloud-based applications & Containerization Technologies

Understanding of metric generation, log aggregation, time-series databases, and distributed tracing

Experience with industry standards like Terraform, Ansible

Fundamentals in Network Design, Cloud architecture, Security, or Computer Science

At least 5 years of hands-on experience in Engineering or Cloud

Minimum 5 years of experience with cloud platforms (e.g., GCP, AWS, Azure)

At least 3 years of experience in configuration and maintenance of applications or systems infrastructure for large-scale customer-facing companies

Experience with distributed system design and architecture

Responsibilities

Develop software solutions to support service delivery processes

Build and manage CI / CD pipelines, automated testing, capacity planning, performance analysis, monitoring, alerting, chaos engineering, and auto-remediation

Innovate relentlessly to ensure a flawless customer experience

Engage in the lifecycle of services from conception to EOL, including system design

Provide consulting and capacity planning

Define and deploy standards related to System Architecture, Service Delivery, metrics, and operational automation

Support services, product, and engineering teams with tooling and frameworks to increase availability and incident response

Improve system performance and efficiency through automation and process refinement

Collaborate with engineering teams to deliver reliable systems

Increase operational efficiency and quality by treating operational challenges as software engineering problems

Mentor junior team members and champion Site Reliability Engineering

Participate in incident response, including on-call duties

Partner with stakeholders to influence technical and business outcomes

Benefits

Comprehensive benefits supporting your personal and professional growth, including wellness programs, tuition reimbursement, expense programs, student loan repayment, childcare, and pet insurance

Inclusive culture with active employee resource groups and supportive leadership

Salary range : $140,300 to $191,550, with variations based on skills, experience, and location

Eligibility for short-term and long-term incentives as part of total compensation

#J-18808-Ljbffr

[job_alerts.create_a_job]

Site Reliability Engineer • San Francisco, California, United States

[internal_linking.similar_jobs]

Site Reliability Engineer - Platform

CodeRabbit • San Francisco, CA, United States

[job_card.full_time]

CodeRabbit is an innovative research and development company focused on building extraordinarily productive human‑machine collaboration systems. Our primary goal is to create the next generation of ...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Site Reliability Engineer

Attain • Redwood City, CA, United States

[job_card.full_time]

Built for consumers and companies, alike.In a world driven by data, we believe consumers and businesses can coexist.Our founders had a vision to empower consumers to leverage their greatest asset—t...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Site Reliability Engineer

HappyRobot • San Francisco, CA, United States

[job_card.full_time]

HappyRobot is the AI‑native operating system for the real economy—a system that closes the circuit between intelligence and action. By combining real‑time truth, specialized AI workers, and an orche...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Senior Site Reliability Engineer – Platform

Icon Ventures • San Francisco, CA, United States

[job_card.full_time]

At Quizlet, our mission is to help every learner achieve their outcomes in the most effective and delightful way.We blend cognitive science with machine learning to personalize and enhance the lear...[show_more]

[last_updated.last_updated_30] • [promoted]

Lead Site Reliability Engineer

VirtualVocations • San Francisco, California, United States

[job_card.full_time]

A company is looking for a Lead Site Reliability Engineer.Key Responsibilities Leads projects focused on managing and maintaining platform infrastructure performance, reliability, and security D...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Site Reliability Engineer

Mercor • San Francisco, CA, United States

[job_card.full_time]

Mercor is at the intersection of labor markets and AI research.We partner with leading AI labs and enterprises to provide the human intelligence essential to AI development.Our vast talent network ...[show_more]

[last_updated.last_updated_30] • [promoted]

Site Reliability Engineer

WorkOS • San Francisco, CA, United States

[job_card.full_time]

WorkOS builds tools and services for developers to help them implement authentication, identity, authorization, and overall enterprise readiness. We’re a fully distributed team with employees across...[show_more]

[last_updated.last_updated_30] • [promoted]

Staff Site Reliability Engineer

Redwood Materials, Inc. • San Francisco, CA, United States

[job_card.full_time]

Redwood is localizing a global battery supply chain that seamlessly integrates recovery, reuse, and recycling—keeping critical minerals in circulation and driving the energy transition.Founded in 2...[show_more]

[last_updated.last_updated_30] • [promoted]

Site Reliability Engineer (SRE)

Air Apps, Inc. • San Francisco, CA, United States

[job_card.full_time]

At Air Apps, we believe in thinking bigger—and moving faster.We’re a family-founded company on a mission to create the world’s first AI-powered Personal & Entrepreneurial Resource Planner (PRP), an...[show_more]

[last_updated.last_updated_30] • [promoted]

Senior Site Reliability Engineer

Alembic Technologies • San Francisco, CA, United States

[job_card.full_time]

Senior Site Reliability Engineer.This range is provided by Alembic Technologies.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.We’re looking fo...[show_more]

[last_updated.last_updated_30] • [promoted]

Site Reliability Engineer

gamma.app • San Francisco, CA, United States

[job_card.full_time]

We're building the creative layer for modern communication.Every month, over a billion people make presentations — but the tools they use to make them haven't evolved in decades.We're changing that...[show_more]

[last_updated.last_updated_30] • [promoted]

Site Reliability Engineer

Mercor, Inc. • San Francisco, CA, United States

[job_card.full_time]

[last_updated.last_updated_30] • [promoted]

Site Reliability Engineer

Primer • San Francisco, CA, United States

[job_card.full_time]

Primer helps B2B products break out of the B2C-centric marketing box.Our platform turns consumer ad channels, data streams, and emerging AI workflows into measurable growth engines for go-to-market...[show_more]

[last_updated.last_updated_30] • [promoted]

Site Reliability Engineer

Plenful • San Francisco, CA, United States

[job_card.full_time]

Plenful is on a mission to transform healthcare operations from the inside out.Fresh off our recent founding round and backed by Notable Capital, Bessemer Venture Partners, TQ Ventures, Susa / Kivu V...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Site Reliability Engineer

Writemed • San Francisco, CA, United States

[job_card.full_time]

[last_updated.last_updated_30] • [promoted]

Site Reliability Engineer

Happyrobot Inc. • San Francisco, CA, United States

[job_card.full_time]

HappyRobot is the AI-native operating system for the real economy—a system that closes the circuit between intelligence and action. By combining real-time truth, specialized AI workers, and an orche...[show_more]

[last_updated.last_updated_30] • [promoted]

Senior Site Reliability Engineer

Mvp VC • San Francisco, CA, United States

[job_card.full_time]

Loft Orbital is revolutionizing access to space by building reliable, shareable satellites that drastically reduce the time and complexity traditionally required to get to orbit.We operate satellit...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Site Reliability Engineer

The Voleon Group • Berkeley, CA, United States

[job_card.full_time]

Voleon is a technology company that applies state‑of‑the‑art AI and machine learning techniques to real‑world problems in finance. For nearly two decades, we have led our industry and worked at the ...[show_more]

[last_updated.last_updated_variable_days] • [promoted]