Dev Ops Engineer

Job Title: Dev Ops Engineer
Category: Technology
Location: CaringBridge Office

Organization Summary

Our Mission is to amplify the love, hope and compassion in the world, making each health journey easier. Our Vision is to be the first place people turn to connect to their community during a health journey. Founded in 1997, CaringBridge is the largest, oldest and most widely used social networking site for family and friends to communicate with loved ones during a health journey. Based in Minnesota, we are proud of our global reach and nonprofit status with nearly 90% of our funding coming from the people who have experienced the power of CaringBridge firsthand. Thanks to the compassion and generosity of our donors and volunteers, CaringBridge is able to connect and serve millions of people each year. Since our founding, more than half a million CaringBridge websites have been created and it’s become an indispensable part of many people’s lives. Every 6 minutes, a new CaringBridge website is created for someone experiencing a health crisis.

Primary Objective of Position

In this role, you’ll work collaboratively with software engineering to deploy and operate our systems. Help automate and streamline our operations and processes. Build and maintain tools for deployment, monitoring and operations to ensure quality implementations. Troubleshoot and resolve issues in our dev, test and production environments.

Duties & Responsibilities

  • Works with application code deployments and planned service outages to production environment
  • Conducts performance monitoring and provides appropriate system tuning to guarantee responsive system performance. Tests and deploys relevant software at most current and stable releases.
  • Develops and implements system disaster recovery plans in accordance with business continuity and disaster recovery plans.
  • Makes modifications/submit tickets for adds/changes/deletes to SPF record as well as other DNS changes.
  • Creates backup strategies to meet business data recovery needs, data retention policy and optimizes system performance.
  • Provides database server administration and maintenance. Implement reliable and consistent backup methodologies. Works with databases technology (currently MySQL and Mongo) to troubleshoot problems and ensure optimal performance. Utilize consultants as applicable to ensure stated responsibility.
  • Scripts and automates workflows, diagramming as possible to maintain best in class service.
  • Works with application code deployments and planned service outages to production environment.
  • Responds to system alerts 24/7 and assesses, troubleshoots and resolves problems related to server, software, database, network and storage.
  • Coordinates/communicates plans and activities with others, as appropriate to ensure a coordinated work effort and team approach.
  • Understand and has worked in an Agile development environment.
  • Stays current on market, competition and trends including tactics, field concepts and practices applying them to strategies when applicable.
  • Keeps manager informed of important developments, potential problems, and related information necessary for effective management. Coordinates/communicates plans and activities with others, as appropriate to ensure a coordinated work effort and team approach.
  • Performs related work as apparent or assigned.

Experience & Qualifications

  • Bachelor’s degree or equivalent experience in computer science, technology, or related field.
  • 5-7 years of experience in computer science, technology or position of similar responsibilities.
  • Strong understanding of Linux system administration and operations.
  • Knowledge of application/network security principles.
  • Strong virtualization, configuration management, cloud architecture, and cloud monitoring.
  • Strong delivery pipeline, build automation, continuous integration.
  • Thorough knowledge of Redhat/CentOS Linux/Unix, MySQL and Mongo databases, Power MTA, Zabbix.
  • Knowledge of QA integration, selenium/selenium grid.
  • Ability to create and maintain scripts and programs used in system and network management.
  • Strong understanding of network protocols and trouble shooting skills.
  • Strong understanding of OOP and design principles.
  • Experience working in high availability environments.
  • Experience with Distributed Version Control.
  • Knowledge of capacity planning.
  • Knowledge of API management traits.
  • Knowledge of more than one server-side programming language.
  • Strong self-starter with a record of success.
  • Strong collaboration and teambuilding skills.
  • Excellent organizational and planning skills.
  • Ability to cope with the rapid pace and constant change associated with the industry.
  • Ability to successfully manage numerous projects simultaneously.
  • Ability to communicate effectively, both orally and in writing with personnel and outside contacts.

To Apply

To respond to this opportunity, please send your resume and salary requirements to:


Attention: HR
2750 Blue Water Road, Suite 275
Eagan, MN 55121