Log in     
Service Reliability Engineer Posted Jul 15
Personal Phoenix GmbH , München, Bayern, Germany
  • This employer requests that only candidates in Germany apply to this job.

    You appear to be located in United States, not Germany, so you will not be able to apply for this job.

Summary Overview of Responsibilities:

Participate in the successful delivery of the end to end services with agreed SLAs to our customers by leveraging

Improving, designing and implementing services that automate application provisioning and manage the underlying infrastructure as a service (all layers, from compute to storage, including network).


Accelerate Application teams' ability to reliably and consistently deliver applications by developing standardized automation to control

Build, artifact and deploy managed services

Integrated into loosely coupled toolchains

Form a common continuous deployment pipeline for application development teams

Other responsibilities include:

capacity planning

change management

problem management

incident management

release management

performance improvement

automation/tool development

Good communication and teamwork is extremely important.

Major Responsibilities:

Support an ultra-highly available cloud-based applicative platform for our customers.

Support application deployments, building new systems and upgrading and patching existing ones.

Develop automation to quickly and rapidly deploy instances from blue-printed applications or golden images.

Develop and use monitoring tools to find problems, resolve and/or escalate to development and ensure that we exceed our SLAs.

Build and manage development and testing environments, assisting developers in debugging application issues using tools.

Participate in the building of tools and processes to support the infrastructure.

Leverage Scripting to build required automation and tools on an adhoc basis.

Operate the platform within our security and privacy guidelines.

Learn on the job and explore new technologies with little supervision.

Ability to use a wide variety of open source technologies and tools.

Experience with systems and IT operations.

Comfort with frequent, incremental code testing and deployment.

A strong focus on business outcomes.

Strong sense of collaboration, open communication and reaching across functional borders.

Provide hands-on engineering, administration and technical support.

Troubleshoot issues across the entire stack - hardware, software, application, and network.

Document current and future configuration processes and policies.

Proactive thought leadership for creative and efficient technology solutions.

Drive continuous improvement to the service delivered to customer (agility, stability ...)

Process reengineering and optimization

Drive the enforcement and definition of operational requirements/non-functional requirements in collaboration with application owners and Middleware organizations.

24x7 pager rotation of the team

Know How/Skills

Software Engineering methodologies and development cycle (Open Source development), including:

 Version Control system (GIT and SubVersion) and Continuous Integration and testing methods (Jenkins)

Oriented Architecture design patterns

knowledge in Networking is needed, including:

Communication Protocols (TCP/IP, DNS, SSH, HTTP/S)

Load balancing techniques, traffic routing, and caching for distributed applications, scalability

Identifying, troubleshooting, and resolving system level issues on large, busy networks

Deployment and infrastructure configuration management tools (such as Maven, Capistrano, Puppet, NPM, etc)

Linux operating system administration (RHEL or SLES)

Linux Containers deployment technologies (Docker or LXC)

C, C++, or Java, and Shell, Perl, GO or Python

monitoring tools and concepts (Kibana, ElasticSearch)

cloud systems and related ecosystem (CouldStack, OpenStack, AWS API, etc...) ?Virtualization Technology (such as EC2, Xen, KVM, OpenStack)

Very good knowledge in relational DB (Oracle, MySQL, MariaDB) and noSQL technology (Cassandra, S3, HBase, Hadoop, MongoDB, CouchBase)

Good understanding of security information and event management technologies

Please forward an up to date CV for a prompt Response.

Employment Type: Contract
Duration: limited for 6 months
Other Pay Info: very attractive