Mixture Posture: SRE is Intriguing

Thursday, March 24, 2022

SRE is Intriguing

As my interest in Business Continuity Planning (BCP) and Disaster Recovery (DR) evolves, I find myself wondering what life would have been like if I had, in the past, tranistioned to being a Site Reliability Engineer. No, I'm definitely not qualified for the position below!

Site Reliability Engineer, Trello

SRE | New York, United States | Remote Americas | Full Time
Apply for this job

Atlassian can hire people in any country where we have a legal entity, assuming candidates have eligible working rights and a sufficient timezone overlap with their team. As our offices re-open, Atlassians can choose to work remotely or return to an office, unless it’s necessary for the role to be performed in the office. Interviews and onboarding are conducted virtually, a part of being a distributed-first company.
As a Site Reliability Engineer at Trello, you’ll work on keeping everything running efficiently as we scale our infrastructure to support our more than 90 million Trello users while maintaining our 99.99% uptime target. The code you write and deploy into production will directly contribute to the scalability and resilience of Trello, and you will directly improve our user’s experience with the product.

Key Responsibilities:

Comfortable owning the infrastructure and pragmatically solving problems dealing with complex systems.
Working with developers to support the latest features that we have in development, like Power-Ups, data pipeline improvements, and scalable microservices.
Contributing your insights across the team to help us improve or re-architect existing systems for scale and extensibility.
Contributing to new and existing compliance initiatives.

On your first day, you will have expertise in:

Engineering microservices and tools across one or more programming languages (e.g. Go, Python).
Automation and Infrastructure-as-Code projects and tooling (e.g. Ansible, Puppet, Terraform).
Building and maintaining a continuous integration and delivery pipeline (e.g. Bamboo, Bitbucket Pipelines, Github Actions).
Observability tools and methodology (e.g. logging, metrics, tracing) for highly available web services.
Designing and delivering AWS cloud-native infrastructure solutions.
Incident response and management in on-call rotation.
Focus on operational maturity and reliability with microservices.

It's great, but not required to have:

Experience building and managing large scale, high impact systems on AWS or other cloud infrastructure.
Experience with large MongoDB cluster deployments.

More about our benefits
Whether you work in an office or a distributed team, Atlassian is highly collaborative and yes, fun! To support you at work (and play) we offer some fantastic perks: ample time off to relax and recharge, flexible working options, five paid volunteer days a year for your favourite cause, an annual allowance to support your learning & growth, unique ShipIt days, a company paid trip after five years and lots more.
More about Atlassian
Creating software that empowers everyone from small startups to the who’s who of tech is why we’re here. We build tools like Jira, Confluence, Bitbucket, and Trello to help teams across the world become more nimble, creative, and aligned—collaboration is the heart of every product we dream of at Atlassian. From Amsterdam and Austin, to Sydney and San Francisco, we’re looking for people who want to write the future and who believe that we can accomplish so much more together than apart. At Atlassian, we’re committed to an environment where everyone has the autonomy and freedom to thrive, as well as the support of like-minded colleagues who are motivated by a common goal to: Unleash the potential of every team.

Thursday, March 24, 2022

SRE is Intriguing

Site Reliability Engineer, Trello

No comments: