Site Reliability Cloud Engineer

April 08, 2024
Offerd Salary:Negotiation
Working address:N/A
Contract Type:Other
Working Time:Full time
Working type:N/A
Ref info:N/A

Your Role and Responsibilities

We are looking for a dynamic Site Reliability Engineer to join our Cloud IaaS Operations Team in Austin, TX, who is responsive to market needs, to deliver value to our clients in a fast-changing cloud landscape. An SRE individual spends 50% time on toil and 50% on engineering projects. It requires full- stack systems thinking and coding skills, with app/service availability focus that is data-driven and AI including machine learning. The SRE team dedicated to ensuring that the IBM Cloud is at the forefront of cloud technology, from data center design, Storage & Network architecture, and compute clusters to flexible infrastructure services. We are operating IBM's cloud platform, building IBM's next generation cloud platform and VMware solutions to deliver performance and predictability for our customers' most demanding workloads, at global scale and with leadership efficiency, resiliency, and security. It is an exciting time, and as a team we are driven by this incredible opportunity to thrill our clients.

Primary Roles & Responsibilities:

In this Site Reliability Engineer role, you will work closely with several Data Centers, the entire Cloud organization and IBM vendors to support, maintain and operationally improve the IBM cloud infrastructure. You will focus on the following key responsibilities:

  • Monitor the health of production and test systems
  • Ability to respond promptly to production issues and alerts
  • Execute changes in the production environment through automation and AI
  • Partner with other SRE teams and program managers to deliver mission- critical services to the market
  • Support development of new and existing capabilities for our compute, storage, and network infrastructure services
  • Implement and automate infrastructure solutions that support IBM Cloud products and infrastructure
  • Support the compliance and security integrity of the environment
  • Automate health monitoring of the production and test systems
  • Automate return to service procedures for Cloud Service delivery
  • Support the compliance and security integrity of the environment through your work
  • Partner with other teams, functional managers, and program managers to deliver mission-critical services to the market
  • Creating power BI dashboards on historic and prediction data for client use case -should be involved in designing the process and implementation of key entities extraction from millions of unstructured files using python NLP techniques and Apache spark.
  • Expertise in Data Interpretation and Visualization skills
  • Define problems and opportunities in a complex business area
  • Develop advanced analytics products
  • Create and develop end-to-end data driven solutions to support and monitor the health of production and test systems
  • Extract data from multiple varied sources and integrate it for analytics and application development
  • Partner with other SRE teams and program managers to deliver mission- critical services to the market
  • Experience with machine learning engineering to develop self-running AI software to automate predictive models
  • Experience with designing machine learning systems and algorithms to generate accurate predictions.
  • Working knowledge with ServiceNow, JIRA, Confluence, and GitHub
  • Working knowledge with Container technologies: Kubernetes (preferred), Docker, etc.
  • Hands on knowledge of log aggregate software such as Splunk or Elk
  • Must have the ability to perform debugging and problem analysis by examining logs and running Unix commands
  • Work with Engineering to:

  • Provide initial assessment and possible workaround of production issue
  • Troubleshoot and resolve production issues
  • Work with Support and Development teams to:

  • Identify and resolve issues
  • Discuss and plan integration tasks
  • Provide technical escalation support for other Infrastructure Operations teams
  • Introduction

    Working in IBM Cloud gives you the platform to learn, develop and utilize your skills everyday by working on the latest cloud related technology products and services. You'll be working in an environment where we understand how we can thrive best when we play to our strengths. That's why developing our people is key to our success, the door is always open for those ready to advance their career.

    Curiosity and courageous thinking are both vital when working in IBM Cloud, as we continue our dedication in guaranteeing that we are at the forefront of cloud technology. Our renowned legacy means we are leading the way in everything from analytics and security through to unmatched hardware & software designs. We provide our clients with the full end-to-end transformation as we build IBM's next generation cloud platform which is focused around delivering performance and predictability at a global scale.

    IBM's product and technology landscape includes Research, Software, and Infrastructure. Entering this domain positions you at the heart of IBM, where growth and innovation thrive.

    Required Technical and Professional Expertise

  • Overall 8+ years of Industry experience with minimum 6+ years of experience in Machine learning
  • Technology expertise of solutioning in Hadoop, Hive, Spark / PySpark, SQL, Oozie along Data Modelling in Hive
  • Proven ability in solutioning covering data ingestion, data cleansing, ETL, data mart creation and exposing data for consumers
  • Scope and deliver solutions with the ability to design solutions independently based on high-level architecture.
  • Data expertise to manipulate and integrate big data, different data types and other structured data bases. Python Skills for Data Handling
  • Preferred Technical and Professional Experience

  • Up-to-date technical knowledge by attending educational workshops, reviewing publications
  • 6+ years of experience in virtualization environments such as AWS, SoftLayer, Xen, or VMWARE
  • Working knowledge & experience with Databases/Storage/Networking in the Cloud
  • Experience with VMware NSX, vRealize Operations Manager, vRealize Network Insight, vSAN
  • Experience in maintaining cloud-based solutions with VMware vCloud Director
  • Experience with replication/failover using Zerto Platform, VMware vCloud Availability or Veeam Cloud Connect
  • Required Education

    High School Diploma/GED

    Preferred Education

    Bachelor's Degree

    About Business Unit

    IBM Systems helps IT leaders think differently about their infrastructure. IBM servers and storage are no longer inanimate - they can understand, reason, and learn so our clients can innovate while avoiding IT issues. Our systems power the world's most important industries and our clients are the architects of the future. Join us to help build our leading-edge technology portfolio designed for cognitive business and optimized for cloud computing.

    Wonder if IBM is the one for you?

    In a world where technology never stands still, we understand that, dedication to our clients success, innovation that matters, and trust and personal responsibility in all our relationships, lives in what we do as IBMers as we strive to be the catalyst that makes the world work better.

    Being an IBMer means you'll be able to learn and develop yourself and your career, you'll be encouraged to be courageous and experiment everyday, all whilst having continuous trust and support in an environment where everyone can thrive whatever their personal or professional background.

    Our IBMers are growth minded, always staying curious, open to feedback and learning new information and skills to constantly transform themselves and our company. They are trusted to provide on-going feedback to help other IBMers grow, as well as collaborate with colleagues keeping in mind a team focused approach to include different perspectives to drive exceptional outcomes for our customers. The courage our IBMers have to make critical decisions everyday is essential to IBM becoming the catalyst for progress, always embracing challenges with resources they have to hand, a can-do attitude and always striving for an outcome focused approach within everything that they do.

    Are you ready to be an IBMer?

    About IBM

    IBM's greatest invention is the IBMer. We believe that through the application of intelligence, reason and science, we can improve business, society and the human condition, bringing the power of an open hybrid cloud and AI strategy to life for our clients and partners around the world.

    Restlessly reinventing since 1911, we are not only one of the largest corporate organizations in the world, we're also one of the biggest technology and consulting employers, with many of the Fortune 50 companies relying on the IBM Cloud to run their business.

    At IBM, we pride ourselves on being an early adopter of artificial intelligence, quantum computing and blockchain. Now it's time for you to join us on our journey to being a responsible technology innovator and a force for good in the world.

    Other Relevant Job Details

    IBM offers a competitive and comprehensive benefits program. Eligible employees may have access to: - Healthcare benefits including medical & prescription drug coverage, dental, vision, and mental health & well being - Financial programs such as 401(k), the IBM Employee Stock Purchase Plan, financial counseling, life insurance, short & long- term disability coverage, and opportunities for performance based salary incentive programs - Generous paid time off including 12 holidays, minimum 56 hours sick time, 120 hours vacation, 12 weeks parental bonding leave in accordance with IBM Policy, and other Paid Care Leave programs. IBM also offers paid family leave benefits to eligible employees where required by applicable law - Training and educational resources on our personalized, AI-driven learning platform where IBMers can grow skills and obtain industry-recognized certifications to achieve their career goals - Diverse and inclusive employee resource groups, giving & volunteer opportunities, and discounts on retail products, services & experiences The compensation range and benefits for this position are based on a full-time schedule for a full calendar year. The salary will vary depending on your job-related skills, experience and location. Pay increment and frequency of pay will be in accordance with employment classification and applicable laws. For part time roles, your compensation and benefits will be adjusted to reflect your hours. Benefits may be pro-rated for those who start working during the calendar year. This position was posted on the date cited in the key job details section and is anticipated to remain posted for 21 days from this date or less if not needed to fill the role. We consider qualified applicants with criminal histories, consistent with applicable law.

    Being You @ IBM

    IBM is committed to creating a diverse environment and is proud to be an equal-opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, gender identity or expression, sexual orientation, national origin, caste, genetics, pregnancy, disability, neurodivergence, age, veteran status, or other characteristics. IBM is also committed to compliance with all fair employment practices regarding citizenship and immigration status.

    More Information on IBM

    IBM operates in the Big Data Analytics industry. The company is located in Armonk, NY, Southbury, CT, New York, NY, Philadelphia, PA, Washington, DC, Durham, NC, Tampa, FL, Smyrna, GA, Huntsville, AL, Chicago, IL, Dallas, TX and San Francisco, CA. It has 533854 total employees. It offers perks and benefits such as Flexible Spending Account (FSA), Disability insurance, Dental insurance, Vision insurance, Health insurance and Life insurance. To see all 83 open jobs at IBM, click here.

    Read Full Job Description

    From this employer

    Recent blogs

    Recent news