Site Reliability Engineer Architect Job at NeerInfo Solutions, Dallas, TX

aHpPZ1ZBMVl4dDBjTSswMVg0Q3cxbzBxNmc9PQ==
  • NeerInfo Solutions
  • Dallas, TX

Job Description

Client is seeking a SRE Architect- The position will primarily be responsible for leading the maintenance and support on custom IT applications. The selected candidate should have good technical knowledge, analytical ability, good communication and previous support experience in managing the support and management projects. Candidate should have experience of leading the team and have good understanding of Incident and Problem management.

Required Qualification:

  • Bachelor’s degree or foreign equivalent required from an accredited institution. Will also consider three years of progressive experience in the specialty in lieu of every year of education.
  • At least 10 years of Information Technology experience.
  • SRE Mindset in Production support: Proactive issue identification using observability tools.
  • Skilled in using different monitoring & observability tools to track system performance
  • Incident commander: Ability to diagnose complex issues and actively drive incident calls working with technical, product SMEs, and Tier 2 SREs.
  • Experience in Splunk (including Splunk APM and Splunk O11y), AppDynamics,
  • Experience in DB, Network, Linux / Unix, Kubernetes
  • Experience in APM, NMON , Wireshark usage and analysis

Preferred Qualification:

Production support expertise with SRE Observability experience :

  • Proactive issue identification using observability tools.
  • Skills in using different monitoring & observability tools to track system performance
  • Production support activities including proactive identification of issues leveraging observability tools, Corelating inputs from various dashboards & tools to drive resolution
  • Experience in swiftly identifying probable failure points through the analysis of multiple inputs from the logs, observability dashboards, recent application changes, infra, network changes etc.
  • Basic level of trouble shooting on every layer of the tech stack (Application, Database, infra (Container platforms) and Network )
  • Experience in setting up observability dashboards based on Splunk logs

Communication :

  • Excellent communicator. They are also expected to actively lead and triage proactively identified issues/incidents where VPs/SVPs are also present in this call.
  • Leadership in triage calls - direct the teams for actions to be taken on the call
  • Automation :
  • Experience in Toil identification and automation

Technical expertise:

  • Analysis of issues via Splunk (including Splunk APM and Splunk O11y), AppDynamics, Grafana, RedMetrics, 1000Eyes
  • Debugging of issues in VMs, Load balancers, Firewalls, API Gateways, DB, Network, Linux / Unix
  • Debugging of issues in Containerization, Docker, Kubernetes, AWS, PCF, Azure
  • Analysis of issues via APM, NMON , Wireshark usage and analysis
  • Database performance monitoring and analysis
  • Experience in UEM and synthetic monitoring set up
  • Experience in heap dump analysis, memory leak analysis and resource optimization

Job Tags

Similar Jobs

Identified Talent Solutions

QA Analyst - Orange County Job at Identified Talent Solutions

 ...Job Description Job Description QA Analyst - Orange County JOB SUMMARY: The QA Analyst's primary responsibility is to ensure the timely and comprehensive testing of various applications, systems, and services. This role involves applying industry best practices... 

Snap Inc.

Product Marketing Manager, Direct Response Job at Snap Inc.

 ...other services; and it's AR glasses, Spectacles ( .The Product Marketing team uses creativity, market research, and insights to...  ...feature requests and on the performance of existing features, directly influencing the product strategy and roadmap+ Meet regularly... 

United Parcel Service

Warehouse Worker/Cover Driver Job at United Parcel Service

 ...ones who drive our familiar brown trucks, bringing packages great and small to our customers. Theyre a friendly, physically active crew who enjoy being outdoors, fast-paced work, and being behind the wheel! While functioning as a temporary cover driver, also known as... 

Swick Mining Services Canada Inc.

Driller Job at Swick Mining Services Canada Inc.

 ...skilled and motivated Driller to join our team in the underground diamond drilling industry. The ideal candidate will have hands-on experience in drilling operations and a strong mechanical aptitude, especially with horizontal drilling techniques. As a Driller, you will... 

Insight Global

UI UX Designer - Remote Job at Insight Global

Job DescriptionInsight Global is seeking a UI/UX Designer to join a global technology company. This role will be gathering requirements in collaboration with the product manager and engineers, focus on the end user experience while illustrating design ideas via storyboards...