Site Reliability Engineer Job at LTIMindtree, Dallas, TX

L0JNb09oUGxnN2dPZ1g1ZS8xdk1JM1Z6UFE9PQ==
  • LTIMindtree
  • Dallas, TX

Job Description

About Us:

LTIMindtree is a global technology consulting and digital solutions company that enables enterprises across industries to reimagine business models, accelerate innovation, and maximize growth by harnessing digital technologies. As a digital transformation partner to more than 700+ clients, LTIMindtree brings extensive domain and technology expertise to help drive superior competitive differentiation, customer experiences, and business outcomes in a converging world. Powered by nearly 90,000 talented and entrepreneurial professionals across more than 30 countries, LTIMindtree — a Larsen & Toubro Group company — combines the industry-acclaimed strengths of erstwhile Larsen and Toubro Infotech and Mindtree in solving the most complex business challenges and delivering transformation at scale. For more information, please visit .

Position: SRE with strong Python Automation

Location: Dallas, TX

Duration: FTE.

Job Description:

We are looking for a highly skilled Automation Engineer with a strong systems engineering background to build scalable, resilient, and intelligent automation solutions. This role demands someone who thrives in aggressive environments, embraces complex challenges, and can operate effectively in uncertain situations. Someone with a automation-first mindset to drive efficiency, reduce manual toil, and enhance operational excellence using modern automation solutions will be a great fit.

As part of a mission-critical team, you will work on automating infrastructure, integrating tools via APIs, improving observability, and implementing AIOps-driven solutions. If you’re passionate about problem-solving, AI/ML in operations, and optimizing large-scale cloud environments, this role is for you.

Key Responsibilities

  • Develop Python-based automation solutions to streamline on-prem and cloud infrastructure management on GCP and Kubernetes.
  • Continuously identify and implement the opportunities to enhance the operational excellence.
  • Build proactive and innovative solutions that can scale.
  • Implement and manage configuration automation using Ansible (desirable).
  • Integrate various tools and services via APIs and client libraries, enabling seamless interoperability across systems.
  • Enhance deployment reliability by implementing automated chaos strategies, failover mechanisms, and self-healing infrastructure.
  • Develop proactive monitoring and alerting solutions using tools like Splunk, GCP Operations Suite, Grafana, and Prometheus.
  • Perform deep root cause analysis (RCA), incident management for complex system failures and develop automation to prevent recurrence.
  • Work on system resilience and performance tuning, ensuring mission-critical applications run efficiently under high loads.
  • Apply AI/ML techniques to automation workflows, enhancing anomaly detection, predictive scaling, and intelligent alerting.
  • Identify and develop AIOps opportunities, reducing operational overhead through intelligent automation.
  • Experiment with machine learning models to optimize log analysis, monitoring insights, and failure predictions.

Required Skills & Experience

  • Strong background in Systems Engineering with a focus on automation and reliability.
  • Proficiency in Python (intermediate to expert level) for developing automation and integrations.
  • Hands-on expertise with Kubernetes and cloud platforms (GCP or any major cloud).
  • Experience integrating various tools and platforms via APIs and client libraries.
  • Deep understanding of monitoring and alerting using Splunk, GCP Operations Suite, Grafana, and Prometheus.
  • Ability to work in aggressive, high-stakes environments where reliability and uptime are critical.
  • Strong problem-solving skills, capable of navigating uncertainty and handling complex challenges.
  • Experience with Ansible for infrastructure automation.
  • Prior experience working in mission-critical teams handling large-scale, high-availability systems is a plus.
  • Enthusiasm for AI/ML and AIOps, with a desire to apply it in automation and operations.

Benefits/perks listed below may vary depending on the nature of your employment with LTIMindtree (“LTIM”):

Benefits and Perks:

  • Medical Plan Covering Medical, Dental, Vision
  • Term and Long-Term Disability Coverage
  • Plan with Company match.
  • Insurance
  • Time, Sick Leave, Paid Holidays
  • Paternity and Maternity Leave

The range displayed on each job posting reflects the minimum and maximum salary target for the position across all US locations. Within the range, individual pay is determined by work location and job level and additional factors including job-related skills, experience, and relevant education or training. Depending on the position offered, other forms of compensation may be provided as part of overall compensation like an annual performance-based bonus, sales incentive pay and other forms of bonus or variable compensation.

Disclaimer: The compensation and benefits information provided herein is accurate as of the date of this posting.

LTIMindtree is an equal opportunity employer that is committed to diversity in the workplace. Our employment decisions are made without regard to race, color, creed, religion, sex (including pregnancy, childbirth or related medical conditions), gender identity or expression, national origin, ancestry, age, family-care status, veteran status, marital status, civil union status, domestic partnership status, military service, handicap or disability or history of handicap or disability, genetic information, atypical hereditary cellular or blood trait, union affiliation, affectional or sexual orientation or preference, or any other characteristic protected by applicable federal, state, or local law, except where such considerations are bona fide occupational qualifications permitted by law.

Safe return to office:

  • In order to comply with LTIMindtree’ s company COVID-19 vaccine mandate, candidates must be able to provide proof of full vaccination against COVID-19 before or by the date of hire. Alternatively, one may submit a request for reasonable accommodation from LTIMindtree’s COVID-19 vaccination mandate for approval, in accordance with applicable state and federal law, by the date of hire. Any request is subject to review through LTIMindtree’s applicable processes.

Job Tags

Holiday work, Local area,

Similar Jobs

UNIQLO

Sr. Visual Merchandising Graphic Designer (Temp - Perm) Job at UNIQLO

Company Overview: Apparel that comes from the Japanese values of simplicity, quality and longevity. Designed to be of the time and for the time, LifeWear is made with such modern elegance that it becomes the building blocks of each individuals style. A perfect shirt ...

Aimic Inc

Pharmacist Job at Aimic Inc

Job Summary: The Pharmacist will dispense drugs prescribed by physicians and other health practitioners and provide information to patients about medications and their use.They may advise physicians and other health practitioners on the selection, dosage, interactions...

Proclinical Staffing

Warehouse Clerk Job at Proclinical Staffing

 ...May require wearing personal protective equipment (PPE). The Warehouse Clerk's responsibilities will be: Assist in loading and unloading trucks while maintaining an orderly work area. Conduct operations safely, complying with relevant standards and... 

Michael Andrews Audio Visual Services

Delivery Truck Driver Job at Michael Andrews Audio Visual Services

Michael Andrews Audio Visual Service is seeking an experienced Delivery Truck Drivers to add to our team. Drivers will also be responsible for assisting the warehouse operations with the loading and unloading of equipment from vehicles. Fleet of vehicles include 20 and...

Fisher Management Partners

Finance Manager Job at Fisher Management Partners

 ...Fisher Management Partners is dedicated to helping clients accelerate growth and drive results that matter.We serve the middle market...  ...service lines include: strategy execution, supply chain solutions, finance solutions, customer experience, technology solutions, and...