Lead Software Engineer - Site Reliability Engineer

at Capital One Services II, LLC in Wilmington, Delaware, United States

Job Description

114 5th Ave (22114), United States of America, New York, New York

Lead Software Engineer - Site Reliability Engineer (SRE)

Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborative, inclusive, and iterative delivery environment? At Capital One, you'll be part of a big group of makers, breakers, doers and disruptors, who solve real problems and meet real customer needs. We are seeking Reliability Engineers who are passionate about marrying data with emerging technologies. As a Capital One Lead Site Reliability Engineer, you'll have the opportunity to be on the forefront of driving a major transformation within Capital One.

What You'll Do:

Collaborate with and across Agile teams to design, develop, test, implement, and support technical solutions in full-stack development tools and technologies

Communicate Service Level Objective concepts to product partners and drive agreement on objectives

Influence the strategic direction of the team, identifying and prioritizing opportunities to improve reliability

Drive resolution of issues on incident calls, providing systematic and logical approaches and prioritize the work of multiple teams to drive resolution

Drive improvements to the incident resolution process with the introduction of automation

Conduct blameless incident reviews and post incident analysis and communicate incident review learnings

Drive implementation of processes or solutions that improve reliability across multiple platforms

Identify gaps in automation and develop strategic plans to drive solutions that reduce toil for the platform teams

Work with other experts to arrive at optimal design and deployment configurations

Establish standards that improve deployment and system reliability for integration pipelines and recommend approaches for chaos testing a particular system

Identify and create proactive, automated approaches for system reliability and alerting and identify key performance indicators for a system, including adding, tuning and maintaining alert configurations

Understand business requirements for system reliability and translate them into implementations such as scaling, failover, timeouts and health checks and work with development teams to test and improve system performance and reliability

Basic Qualifications:

Bachelor's degree

At least 4 years of engineering or development experience (Internship experience does not apply)

At least 2 years of experience with coding or scripting

At least 2 years of experience deploying and supporting Continuous Integration and Continuous Deployment (Jenkins, GitLab, Spinnaker)

At least 2 years of experience with a cloud computing provider (AWS, Microsoft Azure, Google Cloud)

At least 2 years of experience deploying Infrastructure as Code (Terraform, Cloud Formation)

At least 2 years of experience in database server infrastructure (RDS, Aurora, DynamoDB, MySQL, Postgres)

At least 2 years of experience in container orchestration services (ECS, Kubernetes)

At least 2 years of experience building and testing software (Jmeter, Mockito, JUnit)

At least 2 years of experience in a technical leadership role overseeing strategic projects

Preferred Qualifications:

Experience with Agile software development

Experience in Application Performance Monitoring (APM) and Cloud Observability tools ( New Relic, AWS CloudWatch, Open Telemetry, AppDynamics, DynaTrace, Scout)

2+ years of experience with blameless incident reviews and post incident responses

2+ years of experience with secure coding practices

2+ years of experience in creating release documentation

2+ years of experience in logging technologies (log4j configuration, Splunk)

2+ years of experience in resilient system architecture patterns (Microservices Architecture, Layered Architecture, Event-Driven Architecture)

At this time, Capital One will not sponsor a new applicant for employment authorization for this position.

The minimum and maximum full-time... For full info follow application link.

Capital One is an equal opportunity employer committed to diversity in the workplace. Capital One promotes a drug-free workplace. 

All qualified applicants will receive consideration for employment without regard to gender, race, color, religion, national origin, sexual orientation, protected veteran status, or disability status.

Capital One will consider for employment qualified applicants with a criminal history in a manner consistent with the requirements of applicable laws regarding criminal background inquiries, including, to the extent applicable, Article 23-A of the New York Correction Law; San Francisco, California Police Code Article 49, Sections 4901-4920; Newark, New Jersey Ordinance 12-1630; and other applicable federal, state, and local laws and regulations regarding criminal background inquiries. 


Copy Link

Job Posting: 1277235

Posted On: Jun 11, 2024

Updated On: Jun 19, 2024

Please Wait ...