Buf is building a new paradigm for APIs. Our open source and commercial software helps companies adopt Protobuf schemas throughout their stack, from gRPC APIs to streaming data in Kafka. Founded in 2020, we have raised money from leading investors, including Lightspeed, Tiger, and Addition. We’re deployed everywhere from small startups to the world’s largest enterprises, and we’d love your help growing even faster. Check out our open source projects on GitHub and join our Slack community to see what Buf is all about.
About the role
We’re looking to hire an experienced infrastructure/platform engineer to join our top-notch team of engineers who are responsible for building and operating our engineering platform at Buf. As a startup, our infrastructure team has the opportunity to work on a wide variety of problems with the goal of making it easier to build, deploy, and operate our products. Our team doesn’t just treat software like a black box — we’re empowered (and encouraged) to reach in and help improve the software.
This team’s responsibilities include, but are not limited to:
- Making multi-cloud configuration and management a breeze for both Google Cloud and AWS resources (e.g. Kubernetes, Helm, Crossplane, GKE, EKS, managed Postgres).
- Ensuring we have the appropriate environments to validate changes and squash bugs before they reach customers (local, review, and staging environments).
- Maintaining high availability and high release velocity through a seamless deployment and staged release process (e.g. ArgoCD) across multiple production topologies (multi-tenant, single-tenant, and on-prem).
- Enabling excellent observability by empowering engineers to create delightful dashboards and effective alerts (e.g. Prometheus, Grafana).
- Keeping nefarious actors at bay with proper security and access controls, and proving it to our customers by maintaining SOC 2 compliance (e.g. Secureframe, Okta).
You will be responsible for:
- Designing and building solutions for production-ready infrastructure systems.
- Understanding the technical, functional, and business needs of our platform.
- Developing systems that support the security, integrity, availability, and performance of our products.
- Supporting incident management activities that ensure high availability and reduced mean time to recovery (MTTR).
- Handling 1-week on-call rotations (though we rarely get paged).
About you
We’re looking for someone with the following attributes:
- 5+ years of industry experience owning projects as part of a team of software engineers in areas relevant to this role.
- Demonstrated ability to shape, scope, plan, and execute work given an objective.
- Proficient in Go with multiple years of professional experience using it regularly.