Qualifications- Bachelor's degree in Computer Science, Information Technology, or a related field.
Proven experience as a Nagios and Solarwinds Monitoring
- Expert with a strong experience in IT infrastructure & application monitoring.
(MUST HAVE EXPERIENCE WITH BOTH, SOLAR AND NAGIOS)
In-depth experience of Nagios Core and Nagios XI, including configuration, plugins, and dashboards or In-depth experience of SolarWinds products, including NPM, SAM, DPA, and additional modules.
- Proficient in scripting languages such as Powershell, Bash, Python, or Perl for custom plugindevelopment.
Strong experience of networking concepts, server infrastructure, and application architecture.- Proficiency in scripting languages such as PowerShell for automation and customization of SolarWinds solutions.
Familiarity with Linux/Unix systems and networking concepts.
- Strong problem-solving and troubleshooting skills.
Excellent communication and collaboration abilities.
Preferred Skills:- Certification in Nagios, such as Nagios Certified Professional (NCP) or equivalent.
Experience with other monitoring tools and technologies.
- Knowledge of cloud platforms (AWS, Azure, GCP) and containerization technologies (Docker,Kubernetes)
Job Description
Responsibilities:
- Implementation and Configuration:
o Design, deploy, and configure the monitoring solutions based on organizational requirements.
o Customize configurations to monitor a diverse range of infrastructure components, applications, and services.
o Collaborate with cross-functional teams to integrate monitoring solutions seamlessly into existing workflows.
- Monitoring Infrastructure Management:
o Maintain and optimize infrastructure for functionality, stability, scalability and efficiency.
o Ensure the availability and reliability of monitoring services, including high-availability setups and disaster recovery planning.
- Alerting and Notification Setup:
o Develop and implement effective alerting and notification strategies to promptly identify and address issues.
o Fine-tune alert thresholds and escalation procedures to minimize false positives and ensure timely response.
- Performance Analysis and Tuning:
o Conduct regular performance analysis of monitored systems and applications.
o Implement tuning measures to enhance monitoring efficiency and reduce resource utilization.
- Incident Response and Troubleshooting:
o Provide rapid response to incidents detected by monitoring and collaborate with relevant teams to resolve issues promptly.
o Perform root cause analysis for incidents and contribute to post-incident reports.
o Maintain documentation for monitoring systems, including configurations, processes, and best practices.
o Provide training and guidance to team members on the use of monitoring tools and techniques.
o Ensure adherence to relevant standards, policies, and regulations related to monitoring activities.
- Data Analysis and Interpretation
o Collect, analyze, and interpret data from monitoring systems to identify performance issues, bottlenecks, or anomalies.
o Develop metrics and key performance indicators (KPIs) to track and measure the effectiveness of monitored processes.
- Generate reports and dashboards to communicate insights and trends to stakeholders.
Job Type: Full-time
Benefits:
Work from home
Schedule:Supplemental pay types: Overtime pay
Ability to commute/relocate:
- Alabang: Reliably commute or planning to relocate before starting work (Preferred)
Education:
Experience:
* Nagios and Solarwinds Monitoring: 1 year (Preferred)