Sr. Software Engineer, Site Reliability Engineering

Job Function
Information Technology
Position Type
Experienced Professionals
Requisition ID
2025-18990
Posting Location : Location
US-GA-Atlanta

Overview

At Chick-fil-A, Site Reliability Engineering is a technical function which mixes in influence. Across our 3000+ North American stores, cloud, and private data centers, SREs work with our DevOps teams to introduce and hone SRE principles, establish reliability goals, and develop tooling for operational observability. We are a small team working through many different patterns to bring observability to everyone. SREs at Chick-fil-A collaborate across teams and roles, feed learnings back into the organization, and learn all the ways technology is used along the way. The team is focused on tooling and enablement rather than traditional SRE roles.

 

Our Flexible Future model offers a healthy mix of working in person and virtually, strengthening key elements of the Chick-fil-A culture by fostering collaboration and community.

Responsibilities

  • Work independently with DevOps teams to refine running production systems
    • Building on-call processes
    • Creating Incident Management and Response procedures
    • Instrumenting for observability and coaching on best practices
    • Monitoring SLIs
  • Work to varying degrees with DevOps teams
    • Provide consultation on SRE best practices
    • Give guidance on specific topics
    • Oversee groups of dedicated engineers
    • Embed directly with teams
  • Work with teams to define SLOs and error budgets
  • Ensure services and systems meet availability needs of customers
  • Document learnings to share with the broader engineering teams
  • Ensure clear communication around SRE objectives
  • Collaborate broadly across the entire engineering organization
  • Oversee other SREs to bring best practices or learnings from across the organization to them
  • Build internal tooling around operational observability
  • Bring a strong mindset of continual improvement
  • An aversion to toil and automatable tasks
  • Advocate for SRE as a part of engineering culture
  • Act as a conduit for Architecture, Security, Developer Experience, and Common Engineering
  • Keep abreast of industry changes and evaluate for implementation
  • Design and develop software solutions 
  • Serve as a model developer in programming languages like Java, Go, Python, and Python
  • Exercise skills in infrastructure and deployment services like AWS and Kubernetes as well as areas like application security, data analytics, and machine learning

 

Knowledge, skills, and abilities: 

  • Building and supporting systems
  • Enterprise cloud providers
  • Production containerized environments
  • Experience with CI/CD pipelines
  • Investigating system issues and designing solutions
  • Identifying areas for automation or tooling
  • Good interpersonal and team collaboration skills
  • Excellent written and verbal communication skills 
  • Strong analytical and problem-solving skills  
  • Ability to build strong relationships, collaborate, and influence diverse groups of engineers and non-technical roles  
  • Ability to influence other engineers without organizational authority
  • Demonstrated ability to positively influence the engineering culture and practices in a professional environment

 

Note - Working in a DevOps model, this opportunity includes both building and running solutions that could require off hours support. This support is shared amongst the team members to cover weekends and weeknights.  The goal is to design for failure and, using cloud-native infrastructure patterns, automate responses to possible issues so they can be worked during normal hours.  

Minimum Qualifications

  • Bachelor’s Degree or the equivalent combination of education, training and experience from which comparable skills can be acquired
  • 1+ years of relevant work experience
  • Experience designing complex software solutions
  • Broad and deep programming experience in Java, Python, Go, or other comparable languages 
  • Experience with SQL and data modeling 
  • Experience with non-relational databases
  • Experience with source control systems like Git or Subversion 
  • Experience implementing application security, software design patterns, and software development lifecycle  
  • Experience with Observability tooling such as Datadog or Grafana
  • Familiarity with Agile principles as applied to software development
  • Experience developing and deploying applications via cloud infrastructure (AWS preferred)

Preferred Qualifications

  • Bachelor's Degree in Computer science, information technology, computer engineering, or similar area of study
  • 4+ years of total software engineering experience, with: 
    • 1+ years supporting a production system on a Dev-ops team
    • 1+ years working in a cloud platform like amazon web services, google cloud, or Microsoft azure
  • Experience working with an agile development methodology featuring sprints, points estimation, and daily standups 
  • Familiarity with version control systems and code merging/branching; specific experience with git desirable 
  • Experience with Unix/Linux
  • Experience with test-driven development, continuous integration and deployment, Scrum discipline, or comparable software development practice
  • Deep understanding of AWS architecture
  • Experience working with an agile development methodology featuring sprints, points estimation, and daily standups 
  • Experience in design, data collection, and data analysis
  • Excellent written communications (translating complex topics into clear, accurate, and understandable documentation, such as reports, manuals, and proposals)

Minimum Years of Experience

1

Travel Requirements

10%

Required Level of Education

Bachelor's degree or equivalent experience

Preferred Level of Education

Bachelor's Degree

Submit Resume

Sorry the Share function is not working properly at this moment. Please refresh the page and try again later.
Share on your newsfeed