Cloud Observability Engineer in Iselin, New Jersey
Posted 03/22/23
THE TEAM YOU WILL BE JOINING:
Fortune 100 Financial Services Company
100-year history of dedication to customer satisfaction, success and growth
Tremendous growth and new business strategy leading to the need for new talent
Significant investments in cutting-edge technology
WHAT THEY OFFER YOU:
Culture: Excellent work environment that fosters collaboration
Growth: Ability to make an impact on the direction of the organization
Opportunity: Gain hands-on experience working with cutting-edge technology
Stability: Recent financial performance of the company has reported record profits
WHY THIS ROLE IS IMPORTANT:
Establish and enhance a data driven observability IT platform for full-fidelity real-time monitoring of infrastructure, applications, microservices, containers and user interfaces.
Detect errors, latencies, anomalies, performance degradations, problems before users and customers are impacted
Map problems, errors in applications, microservices to owners of these apps, services
Establish a continuous /automated process to use application logs, traces and metrics to detect and report service performance issues
Partnering with other cross-functional teams for issue identification and driving them to resolution utilizing Splunk and other monitoring platforms
Work closely with UNIX, Linux, and Windows server administration teams to diagnose and resolve configuration issues
Training/User Support for Splunk platform and other monitoring components
Use collaboration tools to organize, collaborate, document, share knowledge of work
THE BACKGROUND THAT FITS Required Skills:
5+ years with Splunk Enterprise and Splunk Cloud
3+ years in UNIX / Windows Engineering
2+ years in
SignalFX
OpenTelemetry
Scripting (python, shell, Ansible)
Integration of web technologies (SDKs, REST, JSON, XML, etc.)
Designing/supporting platforms with multi-site and/or highly available designs
Previous experience with Cloud Application Performance Monitoring & Observability
OpenTelemetry implementation or consulting experience
Previous xperience in metric-based onboarding along with Splunk Observability
AWS platform including CloudWatch, Xray, S3, and CloudTrail experience
Proven hands on experience in supporting Splunk Operator on Container platforms like Kubernetes or OpenShift
Experience writing regex to perform field extractions at search time.
Ability to write Splunk Enterprise / Splunk Cloud queries to create complex Splunk dashboard to detect and illustrate capacity trends, constraints, and risks
Regex, shell scripting, Ansible, Python, REST or API Calls, GIT, JSON, XML, web technologies experience