Lead Site Reliability Engineer
O'Fallon, MO
Employer: | Mastercard |
Category: | Information Technology |
Job Type: | Full Time |
Description | |
Our Purpose We work to connect and power an inclusive, digital economy that benefits everyone, everywhere by making transactions safe, simple, smart and accessible. Using secure data and networks, partnerships and passion, our innovations and solutions help individuals, financial institutions, governments and businesses realize their greatest potential. Our decency quotient, or DQ, drives our culture and everything we do inside and outside of our company. We cultivate a culture of inclusion for all employees that respects their individual strengths, views, and experiences. We believe that our differences enable us to be a better team - one that makes better decisions, drives innovation and delivers better business results. Title and Summary Lead Site Reliability Engineer Enterprise Data Accessibility BizOps team is looking for a Site Reliability Engineer who can help solve problems, build CI/CD pipeline and lead Mastercard with automation and best practices. Site Reliability Engineer (SRE) combines software and systems engineering to build and run large-scale, massively distributed services and infrastructures. As a site reliability engineer in the Enterprise Data Accessibility area, you will have the opportunity to manage and enable cloud operations, pipelines, services and infrastructures as well as API support, monitoring and alerting and user behavior. You'll also have the opportunity to manage and enable PowerBI readiness and support as well as Tableau and Enterprise Reporting. You'll need to ensure automation, pipelines, services and infrastructures are reliable, fault-tolerant, efficiently scalable and cost-effective. Mission The role of business operations is to be the production readiness steward for the platform. This is accomplished by closely partnering with developers to design, build, implement, and support technology services. A business operations engineer will ensure operational criteria like system availability, capacity, performance, monitoring, self-healing, and deployment automation are implemented throughout the delivery process. Business Operations plays a key role in leading the DevOps transformation at Mastercard through our tooling and by being an advocate for change and standards throughout the development, quality, release, and product organizations. We accomplish this transformation through supporting daily operations with a hyper focus on triage and then root cause by understanding the business impact of our products. The goal of every biz ops team is to shift left to be more proactive and upfront in the development process, and to proactively manage production and change activities to maximize customer experience, and increase the overall value of supported applications. Biz Ops teams also focus on risk management by tying all our activities together with an overarching responsibility for compliance and risk mitigation across all our environments. A biz ops focus is also on streamlining and standardizing traditional application specific support activities and centralizing points of interaction for both internal and external partners by communicating effectively with all key stakeholders. Ultimately, the role of biz ops is to align Product and Customer Focused priorities with Operational needs. We regularly review our run state not only from an internal perspective, but also understanding and providing the feedback loop to our development partners on how we can improve the customer experience of our applications. What you'll do • Plan, manage, and oversee all aspects of a Production Environment for Enterprise Data Accessibility. • Define strategies for Application Performance Monitoring, Unit Cost and Chaos Engineering aspects. • Find ways for Continuous Optimizations in a Production Environment. • Ability to understand MTTR, SLO, SLI definitions and apply them to services. • Respond to Incidents and improvise platform based on feedback and measure the reduction of • incidents over time. • Ensure reliable, fault-tolerant, efficiently scalable and cost-effective data, services and infrastructures. • Maintain services once they are live by measuring and monitoring availability, latency and overall system health. • Practice sustainable incident response and blameless postmortems. • Ability to isolate problems between hardware and software. Working with appropriate team(s) and vendors until a resolution has been reached. • Performs ad hoc requests from users such as data research and research of process issues, etc. • Engage in and improve the whole lifecycle of services-from inception and design, through deployment, operation and refinement. • Analyze ITSM activities of the platform and provide feedback loop to development teams on operational gaps or resiliency concerns. • Support services before they go live through activities such as system design consulting, capacity planning and launch reviews. • Maintain services once they are live by measuring and monitoring availability, latency and overall system health. • Scale systems sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and velocity. • Support the application CI/CD pipeline for promoting software into higher environments through validation and operational gating, and lead Mastercard in DevOps automation and best practices. • Take a holistic approach to problem solving, by connecting the dots during a production event through the various technology stack that makes up the platform, to optimize mean time to recover. • Work with a global team spread across tech hubs in multiple geographies and time zones. • Ability to share knowledge and explain processes and procedures to others. What experience you need • Bachelor's degree in computer science, software engineering, or a similar field. • Experience in cloud technologies and operations • Experience supporting API's and Cloud technologies • Experience in monitoring/alerting tools such as Splunk and Dynatrace • Experience with performing data analysis, data observability, data ingestion and data integration. • 5+ DevOps, SRE, or general systems engineering experience. • 5+ years of Experience in running production systems. • 2+ years of Hands-on experience in industry standard CI/CD tools like Git/BitBucket, Jenkins, Maven, Artifactory, and Chef. • Experience architecting and implementing data governance processes and tooling (such as data catalogs, lineage tools, role-based access control, PII handling) • Strong coding ability in Python or other languages like Java, C#, Golang, C, C++, Perl or Ruby etc., and a solid grasp of SQL fundamentals • Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive. • Ability to help debug and optimize code and automate routine tasks. • Ability to support many different stakeholders. Experience in dealing with difficult situations and making decisions with a sense of urgency is needed. • Interest in designing, analyzing and troubleshooting large-scale distributed systems. • Appetite for change and pushing the boundaries of what can be done with automation. • Experience in working across development, operations, and product teams to prioritize needs and to build relationships is a must. • Experience designing and implementing an effective and efficient CI/CD flow that gets code from dev to prod with high quality and minimal manual effort is desired. • Good Handle on Change Management and Release Management aspects of Software. What could set you apart • Strong Big Data, Oracle and SQL Server Experience. • SQL tuning experience. • Strong PowerBI experience. • Strong Data Observability Experience. • Operations experience in supporting highly scalable systems. • Ability to operate in a 24x7 environment encompassing global timezones • Self-Motivating and creatively solves software problems and effectively keep the lights on for production systems. Mastercard is an inclusive equal opportunity employer that considers applicants without regard to gender, gender identity, sexual orientation, race, ethnicity, disabled or veteran status, or any other characteristic protected by law. In the US or Canada, if you require accommodations or assistance to complete the online application process or during the recruitment process, please contact reasonable_accommodation@mastercard.com and identify the type of accommodation or assistance you are requesting. Do not include any medical or health information in this email. The Reasonable Accommodations team will respond to your email promptly. Corporate Security Responsibility All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must:
|