Jump to main content.

The Official Careers Website of the City of New York

Search

Site Reliability Engineering Manager

  1. TECHNOLOGY & INNOVATION
Posted on: 02/08/2024
  1. Full-time

Location

BROOKLYN

  1. Exam may be required

Department

APP ENG: DEVOPS

$58,700.00 – $137,500.00

Job Description

This vacancy has now expired.

The Office of Technology and Innovation (OTI/DoITT) oversees all Citywide technology, privacy, cybersecurity, infrastructure, and telecommunications to ensure the security of, and enhance, City operations and service delivery to New York City's residents, businesses, employees, and visitors. As the City's technology and innovation leader, OTI is responsible for operating, maintaining and securing IT infrastructure and systems that touch every aspect of City life from public safety to human services, from education to economic development crossing the full spectrum of governmental operations.

The successful candidate will serve as a Site Reliability Engineering Manager reporting to the Applications Division. Responsibilities include:
- Manage and mentor a team of support engineers focusing on application availability, monitoring, performance, efficiency, change management, and capacity planning for production and non-production systems running nyc.gov, CityShare, and mobile applications;
- Plan and manage upgrade projects for infrastructure, applications, tools, and services;
- Maintain an inventory of all the applications and services provided by the DevOps unit;
- Build and maintain operational tools/services for deployment, monitoring, and knowledge base;
- Perform infrastructure cost analysis and periodically provide recommendations for optimization;
- Measure, optimize, and report on application service performance and availability;
- Track and ensure infrastructure and application patches are applied on time;
- Manage review process for and ensure completeness of Technical and Operational Design Documents;
- Manage remediation of security vulnerabilities identified during pen testing in our production environment;
- Partner with other technical teams within Applications and Infrastructure Management to ensure infrastructure provisioning and application deployments are fully automated;
- Conduct periodic and scheduled reviews of all systems and applications to discover inefficiencies and prepare a plan to remediate or mitigate them;
- Provide project management, sprint planning, and road-mapping support to the DevOps team;
- Collaborate with all the DevOps operational teams to ensure best practices are being followed;
- Assist the Custom Application Development Engineering team, as required;
- Manage the on-call support schedule for the team


Minimum Qualifications

1. A master's degree in computer science from an accredited college or university and three (3) years of progressively more responsible, full-time, satisfactory experience in Information Technology (IT) including applications development, systems development, data communications and networking, database administration, data processing, or user services. At least eighteen (18) months of this experience must have been in an administrative, managerial or executive capacity in the areas of applications development, systems development, data communications and networking, database administration, data processing or in the supervision of staff performing these duties; or

2. A baccalaureate degree from an accredited college or university and four (4) years of progressively more responsible, full-time, satisfactory experience as described in "1" above; or

3. A four-year high school diploma or its educational equivalent, and six (6) years of progressively more responsible, full-time, satisfactory experience as described in "1" above; or

4. A satisfactory combination of education and experience equivalent to "1", "2" or "3" above. However, all candidates must have at least a four-year high school diploma or its educational equivalent and must possess at least three (3) years of experience as described in "1" above, including the eighteen (18) months of administrative, managerial, executive or supervisory experience as described in "1" above.

In the absence of a baccalaureate degree, undergraduate credits may be substituted for a maximum of two (2) years of the required experience in IT on the basis of 30 semester credits for six (6) months of the required experience. Graduate credits in computer science may be substituted for a maximum of one (1) year of the required experience in IT on the basis of 30 graduate semester credits in computer science for one (1) year of the required IT experience. However, undergraduate and/or graduate credits may not be substituted for the eighteen (18) months of experience in an administrative, managerial, executive, or supervisory capacity as described in "1" above.


Preferred Skills

The preferred candidate should possess the following: - 5+ years of satisfactory experience in managing application engineering teams that support real-time systems - 8+ years of development and/or support experience, with at least 3 years of hands-on experience in supporting and optimizing production environments running web applications in the cloud - Strong knowledge of DevSecOps principals and tools - Experience in Azure DevOps or similar tools and CI/CD processes - Experience in architecting, designing, monitoring, and maintaining complex and scalable web applications - Experience in creating and managing projects to improve real-time systems - Experience in infrastructure cost analysis and providing recommendations for optimization - Experience in managing applications and server upgrades and deployments - Strong experience with AWS and/or Azure and knowledge of various cloud services - Knowledge of internet protocols, integration products, and tools - Experience with any relational database such as Oracle, Microsoft SQL Server, or MySQL - Experience with Containerization and Microservices - Experience with any application server such as JBoss, Tomcat, IIS, or Oracle WebLogic server - Proficiency in UNIX shell scripting - Ability to quickly learn new software and technologies - Ability to work independently and collaboratively to resolve client issues without detailed supervision in a fast-paced, dynamic environment - Ability to take ownership of and follow through on client issues until resolution is achieved - Excellent communication skills, both written and oral - Excellent interpersonal, organizational, and problem-solving skills - Ability to work with non-technical end users on complex issues - Ability to communicate professionally and effectively with customers at all levels - Ability to explain technical details in a plain, clear, and simple manner - Ability to absorb and retain information quickly - High level of attention to detail and self-motivation - Ability to take initiative, think creatively, and work independently without significant guidance - Must have positive attitude, be collaborative and possess negotiation skills to work with internal and external teams - Strong passion for technology and willingness to learn new things.
Residency Requirement

City Residency is not required for this position
Additional Information

The City of New York is an inclusive equal opportunity employer committed to recruiting and retaining a diverse workforce and providing a work environment that is free from discrimination and harassment based upon any legally protected status or protected characteristic, including but not limited to an individual's sex, race, color, ethnicity, national origin, age, religion, disability, sexual orientation, veteran status, gender identity, or pregnancy.

Job ID

537953

Title code

1005D

Civil service title

COMPUTER SYSTEMS MANAGER

Title classification

Competitive-1

Business title

Site Reliability Engineering Manager

  1. Experienced (non-manager)

Job level

00

Number of positions

1

Work location

2 Metro Tech

  1. Technology, Data & Innovation

Site Reliability Engineering Manager

Search