Sign In

Blog

Latest News

Monitoring Automation Engineer

Hybrid
Jersey City, NJ, Plano, TX
Posted 6 days ago

Monitoring Automation Engineer

Posted: June 19, 2025 Job Type: Contract Industry: Engineering

We are actively seeking a highly skilled Monitoring Automation Engineer to join our team for a 12-month contract, with a strong possibility of extension. This pivotal role is driven by new funding, as we pivot from a traditional Splunk SME search to focusing on an engineer who can expertly configure and automate monitoring solutions using code, with a solid background in Splunk.


Location & Start Date:

  • Targeted Core Locations: Plano, TX, or Jersey City, NJ.
  • Remote Option: This is a hybrid role, requiring 3 days per week in the office.
  • Targeted Start Date: May/June

What’s the Job?

As a Monitoring Automation Engineer, you will play a critical role in designing, implementing, and automating the monitoring pipelines for applications within our Global Markets division. You’ll collaborate closely with DevOps, Site Reliability Engineering (SRE), and development teams to ensure our monitoring tools deliver actionable insights into system performance and reliability. Your key responsibilities will include:

  • Monitoring Pipeline Design & Implementation: Designing and implementing robust monitoring pipelines using industry-leading tools such as Splunk, Dynatrace, and OpenTelemetry (OTel).
  • Automated Monitoring Tool Deployment: Automating the deployment and configuration of monitoring tools using Terraform for infrastructure as code, Ansible for configuration management, and Jenkins for CI/CD orchestration.
  • Configuration & Version Control: Managing monitoring configurations and ensuring strict version control using Bitbucket and handling package management with Artifactory.
  • CI/CD Integration: Ensuring seamless integration of monitoring solutions directly into existing CI/CD pipelines, enabling automated validation and alerting.
  • Alerting, Logging & Tracing Solutions: Developing and maintaining comprehensive alerting, logging, and tracing solutions to fully support modern observability best practices.
  • Optimization: Continuously optimizing monitoring configurations for optimal performance, cost efficiency, and scalability across the enterprise.
  • Troubleshooting & Root Cause Analysis: Expertly troubleshooting complex monitoring issues and providing thorough root cause analysis for system incidents to prevent recurrence.
  • Documentation: Meticulously documenting monitoring architectures, automation scripts, and established best practices for clarity and knowledge sharing.
  • Technology Advancement: Staying consistently updated on new and emerging monitoring technologies and proactively advocating for their strategic adoption and continuous improvements.

Requirements:

We’re seeking a highly skilled engineer with a strong foundation in automation and monitoring:

  • Infrastructure as Code (IaC): Proven hands-on experience with Terraform for automating infrastructure deployment.
  • CI/CD Pipelines: Demonstrated proficiency in Jenkins for automation orchestration and robust version control tools like Bitbucket.
  • Configuration Management: Solid experience with Ansible for automated deployments and system configuration.
  • Artifact Management: Practical knowledge of Artifactory for efficient package and artifact management.
  • Monitoring & Observability: Direct experience configuring and managing core monitoring and observability platforms such as Splunk, Dynatrace, and OpenTelemetry (OTel).
  • Scripting & Automation: Strong proficiency in scripting languages, specifically Python and Bash shell, for developing automation scripts and tools.
  • Problem-Solving: Exceptional troubleshooting skills for effectively diagnosing complex monitoring and performance issues across distributed systems.

Nice-to-Have Skills:

While not strictly required, candidates with the following will be highly valued:

  • Knowledge of Prometheus, Grafana, or the ELK Stack (Elasticsearch, Logstash, Kibana).
  • Experience with Kafka for streaming data and Kubernetes for container orchestration.

What to Expect on Day 1:

You’ll hit the ground running as a skilled Monitoring Engineer, ready to design, configure, and maintain monitoring pipelines for critical applications within Global Markets. You’ll immediately begin collaborating with DevOps, SRE, and development teams to ensure our monitoring tools provide actionable insights into system performance and reliability, directly impacting business operations.


Interview Process:

  • A streamlined 1-2 round interview process. We aim to conduct interviews as early as next week (week of April 14th).

Job Features

Job CategoryEngineering, Hybrid, IT

Apply For This Job

A valid phone number is required.