Skip to content

Senior Database Reliability Engineer

Gridware
San Francisco, CAhybrid$185,000 - $205,000Feb 5, 2026·Posted 2 months ago
View Application Page

Domain

Tech Stack

PostgreSQLMySQLAmazon RDSAmazon AuroraKafkaGrafanaPrometheusTerraformAnsibleRedshiftSnowflakeDynamoDBMongoDBKubernetes

Must-Have Requirements

  • 5+ years of production relational database management
  • 5+ years of cloud infrastructure experience
  • Hands-on experience with PostgreSQL, MySQL, Amazon RDS/Aurora, or similar
  • Proficiency in monitoring and observability for databases, streaming, and cloud infrastructure
  • Strong troubleshooting skills for complex production systems
  • Database and infrastructure security, access control, and compliance knowledge
  • Ability to collaborate across engineering, DevOps, and Data teams

Nice to Have

  • -Experience with analytical or NoSQL databases (Redshift, Snowflake, DynamoDB, MongoDB)
  • -Containerized deployments and Kubernetes-based operators
  • -Event-driven architecture and distributed system troubleshooting experience
  • -Kafka infrastructure and streaming pipeline management
  • -DevOps practices, automation, and Infrastructure as Code (Terraform, Ansible)

Description

About Gridware Gridware is a San Francisco-based technology company dedicated to protecting and enhancing the electrical grid. We pioneered a groundbreaking new class of grid management called active grid response (AGR), focused on monitoring the electrical, physical, and environmental aspects of the grid that affect reliability and safety. Gridware’s advanced Active Grid Response platform uses high-precision sensors to detect potential issues early, enabling proactive maintenance and fault mitigation. This comprehensive approach helps improve safety, reduce outages, and ensure the grid operates efficiently. The company is backed by climate-tech and Silicon Valley investors. For more information, please visit www.Gridware.io.

Role Description We are seeking a Database Reliability Engineer to own and maintain Gridware’s relational databases, cloud infrastructure, and streaming platforms. This role combines traditional DBA responsibilities ensuring high availability, performance, data integrity and security of databases with infrastructure ownership, including setup and management of Kafka-based streaming pipelines, DevOps automation, and cloud platform management.

You will work closely with Data Engineering, Site Reliability, and DevOps teams to proactively monitor, troubleshoot, and optimize all critical infrastructure, enabling rapid deployment of new features while ensuring reliability and data integrity.

Responsibilities

Administer, monitor, and optimize relational databases (PostgreSQL, Amazon RDS) for performance, availability, and security. Troubleshoot complex database and infrastructure issues, including query performance, replication, schema evolution, and event streaming pipelines. Maintain and support Kafka infrastructure for company-wide streaming pipelines and integration with databases. Implement backup, restore, and disaster recovery strategies for databases and streaming platforms. Collaborate with DevOps and Data Engineering teams to maintain CI/CD pipelines for schema, data, and infrastructure changes. Enforce database and infrastructure best practices, standards, and security policies. Proactively monitor health and performance of databases, streaming pipelines, and cloud infrastructure using Grafana, Prometheus, or equivalent. Contribute to Infrastructure as Code (Terraform, Ansible) for database, Kafka, and cloud infrastructure provisioning and management. Support internal teams during incidents or urgent troubleshooting, balancing reliability with rapid deployment needs.

Required Skills 5+ years of experience managing production relational databases and cloud infrastructure. Hands-on experience with PostgreSQL, MySQL, Amazon RDS/Aurora, or similar. Proficiency in monitoring and observability for databases, streaming, and cloud infrastructure. Strong troubleshooting skills for complex, multi-layered production systems. Knowledge of database and infrastructure security, access control, and compliance best practices. Ability to collaborate across engineering, DevOps, and Data teams.

Bonus Skills Experience with analytical or NoSQL databases (Redshift, Snowflake, DynamoDB, MongoDB). Containerized deployments and Kubernetes-based operators for databases or Kafka. Event-driven architecture experience and distributed system troubleshooting. Experience managing Kafka infrastructure and supporting streaming pipelines. Familiarity with DevOps practices, automation, and Infrastructure as Code (Terraform, Ansible, or similar).

Location Context