Sr. Professional I, Server Engineering


Main Responsibilities

  • Troubleshooting and resolution of problems
  • Respond to availability incidents and provide support for service engineers
  • Make monitoring and alerting alert on symptoms and take appropriate actions.
  • Document every action so your findings turn into repeatable actions–and then into automation.


  • 5+ years in Linux Systems Administration role in a large physical and virtual environment required.
  • 5+ years’ experience in Redhat Linux, Unix administration (must have).
  • +5 years of experience managing Linux servers running Red Hat Enterprise Linux (RHEL), SUSE Linux, CentOS, or hosted at a cloud provider like Amazon Web Services (AWS), Google Compute Engine (GKE), or Microsoft Azure (AKS) and On-Premise Kubernetes.
  • Experience operating within a large complex server and application environment with greater than 500 servers under management.
  • Experience in Redhat Virtualization (Strongly preferred)
  • Expert level knowledge in Unix/Linux and Related Services, including: Red Hat Enterprise Linux v7.
  • Very good experience in building, maintaining H.A. Linux clusters (Red Hat Cluster Services, Global File System).
  • Strong Experience with Configuration Management (e.g. Red Hat Satellite Server, Chef, Puppet).
  • Web Server Support (Apache, Tomcat, NGINX).
  • Experience managing servers supporting databases (preferably Oracle and Oracle RAC). (Preferred)
  • Working knowledge of ITIL, ISO, and IT audit frameworks.
  • Strong experience with large scale Infrastructure, distributed systems, and application performance.
  • 3+ years of experience with enterprise storage systems or distributed systems; knowledge of Ceph Storage is a plus
  • 4+ years of experience in an Dev/SRE Role
  • 4+ years of experience within a fast-paced site reliability function
  • 4+ years of experience working in a continuous delivery environment with proven track
  • 4+ years of experience with GitLab, Jenkins, CI/CD, infrastructure-as- code
  • 4+ years of experience with software design principles
  • 2+ Years of Experience Container Security
  • Experience on Python, Shell Scripting, Go Lang will be a plus.
  • Logging experience using Graylog, ELK stack
  • Implement highly available systems and disaster recovery
  • Automation using Puppet, Ansible, Chef, Terraform, Helm, Cobbler
  • Core Networking, Consul DNS Load balancing, Metallb, Weave, Cilium
  • Ingress Controller, Istio,
  • Forman Katello, Redhat Satellite
  • FreeIPA, Bind, DNS Security
  • Experience in integrating different systems with their available API’s
  • Extensive experience using monitoring tools such as Prometheus, Grafana, Nagios, Zabbix
  • Extensive monitoring and troubleshooting experience using native tools and monitoring systems (both Open Source and vendor supplied tools).
  • Experience managing bare-metal and virtualized servers.
  • Candidate will possess strong knowledge of Linux systems and internals, networking and related protocols.
  • Experience in supporting and troubleshooting a large-scale Linux server production environments attached to enterprise class storages.
  • Working Experience with the following: Apache, Tomcat, ColdFusion,
  • Proficiency with GitOps, source control, continuous integration and testing methods (svn, git) is preferred.
  • Knowledge of load balancers and Layer 7 traffic routing via Nginx, HA Proxy, NetScalers, or equivalent is preferred.
  • Demonstrated ability to leverage appropriate technical tools to perform day-to-day administration tasks, root-cause analysis and service restoration for Unix/Linux/Solaris
  • 5 years’ experience in management positions, including leading technical teams.
  • Experience working with larger international organizations will be preferred.
  • Excellent Communication skills and team work is a must
  • Adherence to process compliance based on organization/client standards frameworks and tools.
  • Ensure that team complies with processes as part of service delivery.
  • Ensure that responsibilities related to administration are met by the Leads (such as time sheets shifts weekly reviews).
  • Responsible for Operational aspects of Project Administration
  • Ensure effective change control policies are in place.
  • Ensure availability of required infrastructure for efficient delivery.
  • Responsible for optimum resource planning and management.
  • Ensure optimum staffing and operational rigor in the shifts.
READ:   React Developer – Full-time, Remote


  • Nivel de antigüedad

    Sin experiencia

  • Tipo de empleo

    Jornada completa

  • Función laboral

    Gestión y Manufactura

  • Sectores

    Servicios y tecnologías de la información , Dotación y selección de personal y Recursos humanos

Postular aquí

Por favor, para solicitar este trabajo visita