Site Reliability Engineer SRE Job at SOMERSET STAFFING, Washington DC

QWZWYTZhbi9rdmNidEcvbU5mak03TW5CQlE9PQ==
  • SOMERSET STAFFING
  • Washington DC

Job Description

Randstad is seeking a Site Reliability Engineer for a high-impact role with a premier client based in Washington, DC . In this position, you will bridge the gap between development and operations by applying a software engineering mindset to system administration and infrastructure. You will be responsible for ensuring the scalability, performance, and high availability of cloud-based services across AWS and Azure environments. By leveraging Infrastructure-as-Code, advanced observability with Dynatrace, and SRE principles like error budgets and SLOs, you will drive operational excellence and lead incident response efforts for mission-critical applications.

Key Responsibilities
  • Deployment & Automation: Architect and manage CI/CD pipelines (GitHub Actions, AWS CodePipeline) and automate global infrastructure using Terraform, CloudFormation, or CDK.
  • Performance & Capacity: Drive cost-optimization initiatives, manage auto-scaling thresholds, and execute resiliency/performance testing to ensure system durability.
  • Incident Management: Act as a primary on-call responder using ITIL frameworks and ServiceNow; develop Root Cause Analysis (RCA) documentation and maintain knowledge bases.
  • Observability & Monitoring: Implement distributed tracing and optimize monitoring via Dynatrace and Kibana to create advanced dashboards and anomaly detection.
  • Reliability Engineering: Define and monitor SLIs and SLOs while managing error budgets to balance feature velocity with system stability.
  • Security & Compliance: Oversee service accounts, manage digital certificates, and execute rapid remediation for security incidents.
Qualifications
  • Education: Bachelor's degree in Computer Science, Engineering, or a related technical field.
  • Experience: 2 to 4 years of professional experience in SRE, DevOps, or Infrastructure roles.
  • Cloud Proficiency: Practical, hands-on experience with both AWS and Azure platforms.
  • Technical Skills: Mid-level proficiency in Python (or similar scripting languages) and configuration management tools like Ansible.
  • Containerization: Solid understanding of Docker and orchestration via Kubernetes or ECS.
  • Infrastructure Fundamentals: Strong knowledge of Linux systems, networking protocols, and both Relational/NoSQL database architectures.
  • Soft Skills: Excellent written and verbal communication skills with the ability to manage competing priorities independently.
  • Flexibility: Ability to participate in a production on-call rotation, including work outside standard business hours.

Required Skills :

Basic Qualification :

Additional Skills :

This is a high PRIORITY requisition. This is a PROACTIVE requisition

Background Check : No

Drug Screen : No

Job Tags

Similar Jobs

TouchPoint

HOUSEKEEPER (ON CALL) Job at TouchPoint

 ...are hiring immediately for an on-call HOUSEKEEPER position. Location: Ascension Living...  ...upon interview. Requirement: No prior experience is required. Fixed Pay Rate: $15.00...  ...to maintain establishments, including hotels, restaurants and hospitals, in a clean and... 

VIPworkforce.com

Material Handlers and Forklift Drivers Job at VIPworkforce.com

 ...Job Description Job Description MATERIAL HANDLERS/ FORKLIFT OPERATORS Shifts : 1st, 2nd, 3rd Pay Rate : $15-$17*Overtime is available as needed. NO EXPERIENCE NECESSARY- Will Train Requirements Must be authorized to work for any employer in the... 

GLC On-The-Go

Travel Medical Technologist Job at GLC On-The-Go

 ...Job Description GLC On-The-Go is seeking a travel Medical Technologist for a travel job in Atlanta, Georgia. Job Description & Requirements ~ Specialty: Medical Technologist ~ Discipline: Allied Health Professional ~ Start Date: 02/09/2026~ Duration: 13... 

Fort Worth Staffing

First Officer Pilot Job at Fort Worth Staffing

 ...Pilot Position To be considered for a pilot position, you must: Be at least 21 years of age Hold a Commercial Pilot Certificate with Multi-Engine and Instrument Ratings English Proficiency Endorsement (EPE) on Airman Certificate Hold a current FAA First... 

TEKsystems

Data Center Technician Job at TEKsystems

 ...B side Days: Thursday, Friday, Saturday, & Every other Wednesday 6a - 6p Job Brief: We are seeking a skilled Level 3 Data Center Technician to join our IT team. The Level 3 Data Center Technicians responsibilities include maintaining the data centers IT system,...