Title: Site Reliability Engineer
Location: Sunnyvale, CA
Contract duration: 6+ months Contract with extension
Work on monitoring stack and provide support to engineering team.
Intermediate level Python scripting highly preferred, any other programming language will also work.
Grafana, Riemann or any other monitoring tools.
- Supporting Apigee's existing monitoring and alerting stack.
- Metrics visualization/dashboards/APIs
- Distributed metrics storage and querying for OSS InfluxDB
- Anomaly detection in system metrics.
Python, InfluxDB, Grafana, Riemann
- Someone with minimum 2 years of experience working in a similar role as monitoring and alerting tools infrastructure role.
- Site reliability engineer
- Must have at least intermediate level Python programming skills for writing scripts and developing tools.