Northrop Grumman Computer Operations Analyst 4 in Fairfax, Virginia
Are you interested in expanding your career through experience and exposure, all while supporting a mission that seeks to ensure the security of our nation and its allies? If so, then Northrop Grumman is the place for you. As a leading global security company, we provide innovative systems, products and solutions to our customers worldwide. We are comprised of diverse professionals that bring different perspectives and ideas, understanding that the more experiences we bring to our work the more innovative we can be. As we continue to build our workforce we look for people that exemplify our core values, leadership characteristics, and approach to innovation.
Northrop Grumman is seeking a Service Continuity professional to lead a team of qualified, diverse individuals managing the Defense Travel System (DTS) operational service. The DTS is an enterprise service providing application and infrastructure services to the DOD. Collaborate with the program teams to keep aware of application and infrastructure changes to ensure compliance with system performance requirements.
The Service Continuity Lead will perform, train and lead a team to provide 24x7 situational awareness of the operational service. This includes the application, infrastructure, connectivity, and supporting services, e.g., related services provided by third parties. Develop and improve processes for the overall system monitoring, control, and alert detection that facilitate the mitigation of operational system before they become user problems.
Roles and Responsibilities
Provide 24x7 operations monitoring solution and triage service anomalies detected or reported by the end user community.
Maintain and enhance program-wide processes for remediation of potential issues before they become problems
Responsible for daily operations of a team or work unit (assignment of work, schedules, day-to-day workflow).
Prepare regular computer operations performance reports for management.
Provide direction to subordinates using established policies and precedents.
Follow and enhance established procedures and timelines to ensure projects complete on schedule
Establish and maintain processes and procedures for direct control and management over application and infrastructure
Create and maintain standard operating procedures and a knowledgebase for triaging the operational service
Performing routine data loads using defined procedures
Maintaining an operational change management process to ensure traceability to all operational service activities
Perform routine startup and shutdown of application and infrastructure components
Escalate issues to other program teams to assist with resolving a service anomaly.
In the case of a severe incident that affects end users, the manager coordinates communications among the staff in order to expedite service restoration.
Provide oversight and management of system incidents and assists the help desk teams by addressing trouble tickets pertaining to operational issues.
Perform routine infrastructure changes and coordinates monthly maintenance activities.
Support integration of external interface changes to ensure the operational service is not interrupted
Communicate with internal and external stakeholders regarding operational service metrics, operational status, and incident management activities
Maintain detailed outage logs for the operational service including service interruptions caused by external interfaces
Monitor system events, availability, and performance including response time, space management, interfaces, etc. for system configurations
9 Years experience with Bachelors; 7 Years with Masters; 4 Years with PhD; an additional 4 years of experience will be considered in lieu of degree.
Demonstrated experience implementing and expanding Splunk to improve system monitoring
Hands on experience monitoring and triaging a J2EE application in a Unix/Linux environment
Ability to provide on-call and after hours support
The ability to obtain and maintain a Secret clearance
ITIL Practitioner Certification Support and Restore (IPSR) and/or Release and Control (IPRC)
Experience using Oracle Enterprise Manager (OEM) to triage system problems
Current ITIL Foundation certification or higher
Experience using Atlassian JIRA and Confluence
Microsoft Office experience
Experience with Security processes and incident handling
Proven ability to train junior analysts to monitor and continuously improve the existing suite of tools
Experience working with internal and external customers
Northrop Grumman is committed to hiring and retaining a diverse workforce. We are proud to be an Equal Opportunity/Affirmative Action Employer, making decisions without regard to race, color, religion, creed, sex, sexual orientation, gender identity, marital status, national origin, age, veteran status, disability, or any other protected class. For our complete EEO/AA and Pay Transparency statement, please visit www.northropgrumman.com/EEO . U.S. Citizenship is required for most positions.