Senior Site Reliability Engineer (Application Support)
DTCC
- Location
- Hybrid (Jersey City, NJ)
- Employment
- Full-time
- Level
- Senior Level
About the Role
DTCC is a leading post-trade market infrastructure for the global financial services industry, automating and centralizing financial transactions to mitigate risk and enhance efficiency. As a Senior Site Reliability Engineer, you will ensure the stability and performance of mission-critical applications, driving continuous improvement and operational excellence.
Skills
Benefits
- Health Insurance
- Life Insurance
- Retirement Benefits
- Paid Time Off
Perks
- Hybrid Work
Full job details
Are you ready to make an impact at DTCC?
Do you want to work on innovative projects, collaborate with a dynamic and supportive team, and receive investment in your professional development? At DTCC, we are at the forefront of innovation in the financial markets. We're committed to helping our employees grow and succeed. We believe that you have the skills and drive to make a real impact. We foster a thriving internal community and are committed to creating a workplace that looks like the world that we serve.
Pay and Benefits:
- Competitive compensation, including base pay and annual incentive
- Comprehensive health and life insurance and well-being benefits, based on location
- Pension / Retirement benefits
- Paid Time Off and Personal/Family Care, and other leaves of absence when needed to support your physical, financial, and emotional well-being.
- DTCC offers a flexible/hybrid model of 3 days onsite and 2 days remote (onsite Tuesdays, Wednesdays and a third day unique to each team or employee).
The impact you will have in this role:
As a Senior Application Support Engineer (SRE), you will play a critical role in ensuring the stability, reliability, and performance of mission-critical applications at DTCC.
This role goes beyond traditional support—focusing on Site Reliability Engineering principles, proactive system improvement, and operational excellence. You will partner closely with development, infrastructure, and global operations teams to enhance system resilience, reduce operational toil, and drive continuous improvement across the platform.
Your Primary Responsibilities:
- Act as a Lead Application Support Engineer with SRE responsibilities, partnering with engineering and infrastructure teams to improve system reliability, resilience, and observability
- Lead the resolution of critical production incidents, providing clear impact analysis, root cause identification, and preventive actions
- Own and drive incident, problem, and major incident management, including post-incident reviews and continuous improvement
- Proactively identify reliability risks and implement solutions to prevent recurrence and reduce operational toil
- Develop, maintain, and enhance runbooks, knowledge articles, and operational documentation
- Execute and support release, change, and deployment activities, including production releases and vendor upgrades
- Support and participate in Disaster Recovery (DR) testing, execution, and audit readiness
- Drive automation and alert optimization initiatives to improve efficiency and reduce noise
- Embed risk, control, and reliability best practices into day-to-day operations
- Collaborate with global teams to ensure high availability and operational excellence across systems
**NOTE: The Primary Responsibilities of this role are not limited to the details above. **
Qualifications:
- 6+ years of experience in application support, SRE, or production engineering
- Bachelor's degree preferred or equivalent experience
Required Skills
- Strong understanding of SRE principles, including reliability engineering, observability, and incident prevention
- Experience working in Linux and Windows environments, with strong troubleshooting and log analysis skills
- Hands-on experience with monitoring and observability tools (e.g., Splunk, Grafana)
- Working knowledge of SQL for analysis and troubleshooting
- Experience with ITSM tools (e.g., ServiceNow) for incident, problem, and change management
- Familiarity with job scheduling and modern platforms (e.g., Autosys, OpenShift, containers)
- Exposure to mainframe technologies, including job processing, scheduling, and legacy system interactions
- Understanding of AI/ML concepts in production support (e.g., automation, AIOps, anomaly detection, incident reduction)
- Understanding of security fundamentals (certificates, access, credentials)
- Experience supporting AWS-based applications and services
- Strong communication, ownership, and problem-solving skills in high-pressure environments
- Experience working with global, distributed teams
The salary range is indicative for roles at the same level within DTCC across all US locations. Actual salary is determined based on the role, location, individual experience, skills, and other considerations. We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, sex, gender, gender expression, sexual orientation, age, marital status, veteran status, or disability status. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.
DTCC proudly supports Flexible Work Arrangements favoring openness and gives people freedom to do their jobs well, by encouraging diverse opinions and emphasizing teamwork. When you join our team, you’ll have an opportunity to make meaningful contributions at a company that is recognized as a thought leader in both the financial services and technology industries. A DTCC career is more than a good way to earn a living. It’s the chance to make a difference at a company that’s truly one of a kind.
Learn more about Clearance and Settlement by clicking here.
Serves as a dedicated technology resource for advancing DTCC’s business opportunities and providing industry thought leadership for leveraging new technology. The goal of this new department is to partner internally with IT, our business and regulatory divisions and externally with clients, regulators, and fintech vendors, to help build new platforms and business models to advance DTCC’s mission to support the financial markets.
Not the right fit?
Browse all DevOps & SRE roles.