Lead Site Reliability Engineer

Ford Motor Company
April 16, 2024
Offerd Salary:Negotiation
Working address:N/A
Contract Type:Other
Working Time:Negotigation
Working type:N/A
Ref info:N/A

Job Description

We are the movers of the world and the makers of the future. We get up daily, roll up our sleeves, and build a better world together. At Ford, we believe freedom of movement drives human progress and we're all a part of something bigger than ourselves. You will have the opportunity to accelerate your career potential as you help us define tomorrow's transportation.

As a key member of our Enterprise Technology Group, you'll play a critical part in crafting the future of mobility. If you're looking for the chance to bring to bear advanced technology to redefine the transportation landscape, enhance the customer experience, and improve people's lives, this is your opportunity. Join us and challenge your IT expertise and analytical skills to help build vehicles as inquisitive as you are.

We are looking for a Site Reliability Engineer (SRE) to join our Ford Technology Team who can combine software engineering and systems engineering to ensure a software system is available, scalable, and maintainable. Your business domain knowledge to support our critical eCommerce program.


What you'll be able to do:

As SRE managing a large, distributed application built on microservices, Spring Boot, and Google Cloud may include:

  • Focus on the reliability and maintainability of existing and new systems.
  • Run the production environment by monitoring availability and taking a holistic view of system health.
  • Developing, improving, and operating the deployment and orchestration of a complex distributed system
  • Improve reliability, quality, and time-to-market of our suite of software solutions.
  • Measure and optimize system performance, to push our capabilities forward, getting ahead of customer needs, and innovate to continually improve.
  • Provide primary operational and engineering Support for multiple large, distributed software applications.
  • Identify and reduce or eliminate toil via automation to maximize the time spent on engineering and innovation.
  • Collaborating with development teams to design, build, and operate scalable and resilient software systems.
  • Automating deployment, monitoring, and incident response processes
  • Performing root cause analysis of production incidents and implementing preventive measures
  • Conducting performance analysis and optimization of the system
  • Ensuring compliance with security and regulatory standards
  • Implementing and maintaining disaster recovery processes
  • Providing technical guidance and mentorship to other team members
  • Participating in an on-call rotation for incident response and support
  • Qualifications

    The minimum requirements we seek:

  • Bachelor's degree in Computer Science, Computer Engineering, or a related field
  • 5+ years' experience with JAVA, J2EE, NoSQL/SQL Datastore, Spring Boot, GCP/AWS/Azure & Docker/K8 in developing multi-tier applications.
  • 4+ years of experience with any APM and other monitoring tools such as Dynatrace, New Relic, ELK, Splunk, Prometheus, Sensu, Nagios, Kafka, DataDog, and PagerDuty.
  • Experience with infrastructure as code (IaC) using tools like Terraform, Google Cloud Deployment Manager, or similar.
  • Our preferred qualifications:

  • Advanced degree in Computer Science, Computer Engineering, or a related field
  • Strong background in software development and systems administration, as well as excellent problem-solving and communication skills.
  • Experience with RESTful APIs and microservices platforms
  • Working knowledge of the TCP/IP stack, internet routing and load balancing
  • Experience with product & development teams to establish error budgets by identifying the right SLOs (Service level objective), SLIs (Service level indicators), and KPIs (Key performance indicators) and effectively drive the use of the budget to ensure maximum domain availability/uptime.
  • Thorough understanding of the software development cycle and agile programming environment
  • Regularly review key site technical metrics such as transaction errors, logging, response times, caching strategies, conversion/bounce rates, capacity & resource utilization.
  • Proactively identify stability risks & work with engineering leadership to establish appropriate mitigation plans
  • Experience in solving complex architecture/design & business problems, working to simplify, optimize, remove bottlenecks, etc.
  • Architect, design & develop automation to reduce toil, and improve recoverability, availability, latency & scalability of supported applications with an understanding of MTTD (Mean Time to Detection) & MTTR (Mean Time to Resolution)
  • Maintain a knowledge repository that includes Standard operating procedures, Release checklists, and Runbooks for incident recovery.
  • Debug production issues across services and levels of the stack.
  • What you'll receive in return:

    As an established global company, we offer the benefit of choice. You can choose what your Ford future will look like: will your story span the globe, or keep you close to home? Will your career be a deep dive into what you love or a series of new teams and new skills? Will you be a leader, a changemaker, a technical expert, a culture builderor all the above? No matter what you choose, we offer a work life that works for you, including Immediate medical, dental, and prescription drug coverage, generous PTO, retirement, and savings plans, incentive compensation, tuition assistance, a vehicle discount program, and much more

    For information on Ford's salary and benefits, please visit: https: // corporate.ford.com/content/dam/corporate/us/en- us/documents/careers/2024-benefits-and-comp-GSR-sal-plan-2.pdf

    Candidates with Ford Motor Company positions must be legally authorized to work in the United States. Verification of employment eligibility will be required at the time of hire. Visa sponsorship is available for this position.

    We are an Equal Opportunity Employer committed to a culturally diverse workforce. All qualified applicants will receive consideration for employment without regard to race, religion, color, age, sex, national origin, sexual orientation, gender identity, disability status, or protected veteran status. In the United States, if you need a reasonable accommodation for the online application process due to a disability, please call 1-888-336-0660.

    About Us

    At Ford Motor Company, we believe freedom of movement drives human progress. With our incredible plans for the future of mobility, we have a wide variety of opportunities for you to accelerate your career and help us define tomorrow's transportation.

    About the Team

    We believe that freedom of movement drives human progress. Ford Information Technology (IT) is shaping the future of mobility by redefining the transportation landscape, enhancing the customer experience and improving people's lives. Join the Ford family as we change the way the world moves.

    More Information on Ford Motor Company

    Ford Motor Company operates in the Automotive industry. The company is located in Dearborn, MI and Palo Alto, CA. Ford Motor Company was founded in 1903. It has 175633 total employees. It offers perks and benefits such as Flexible Spending Account (FSA), Disability insurance, Dental insurance, Vision insurance, Health insurance and Life insurance. To see all 165 open jobs at Ford Motor Company, click here.

    Read Full Job Description

    From this employer

    Recent blogs

    Recent news