Dev. Ops. Engineer

Job Title: Dev. Ops. Engineer

Category: Technology Department

Location: CaringBridge Office

Organization Summary

Founded in 1997 with a mission to amplify the love, hope and compassion in the world, making each health journey easier, CaringBridge is the largest, oldest and most widely used social networking site for family and friends to communicate with loved ones during a health journey. Based in Minnesota, we are proud of our global reach and nonprofit status with nearly 90% of our funding coming from the people who have experienced the power of CaringBridge firsthand. Since our founding, more than half a million CaringBridge websites have been created and it’s become an indispensable part of many peoples lives. Every 7 minutes, a CaringBridge website is created for someone experiencing a health crisis.

Primary Objective of this Position

To deploy and operate our systems, working collaboratively with the software engineers. To help automate and streamline our operations and processes. To build and maintain tools for deployment, monitoring and operations and troubleshoot and resolve issues in our dev, test and production environments.

Duties & Responsibilities

  • Works with software development team and system operations team to configure and maintain technology stacks to support development cycles.

  • Conducts performance monitoring and provides appropriate system tuning to guarantee responsive system performance. Tests and deploys relevant software at most current and stable releases.

  • Develops and implements system disaster recovery plans in accordance with business continuity and disaster recovery plans.

  • Creates and maintains documentation for DevOps practices and procedures.

  • Makes modifications/submit tickets for adds/changes/deletes to SPF record as well as other DNS changes.

  • Creates backup strategies to meet business data recovery needs, data retention policy and optimizes system performance.

  • Provides database server administration and maintenance. Implements reliable and consistent backup methodologies. Works with databases technology (currently MySQL and Mongo) to troubleshoot problems and ensure optimal performance. Utilizes consultants as applicable to ensure stated responsibility.

  • Scripts and automates workflows, diagramming as possible to maintain best in class service.

  • Works with application code deployments and planned service outages to production environment

  • Responds to system alerts 24/7 and assesses, troubleshoots and resolves problems related to server, software, database, network and storage.

  • Coordinates/communicates plans and activities with others, as appropriate to ensure a coordinated work effort and team approach.

  • Keeps manager informed of important developments, potential problems and related information necessary for effective management. Coordinates/communicates plans and activities with others, as appropriate to ensure a coordinated work effort and team approach.

  • Stays current on market, competition and trends including tactics, field concepts and practices applying them to strategies when applicable.

  • Performs related work as apparent or assigned.

Experience & Qualifications

To perform this job successfully, an individual must be able to perform each essential duty satisfactorily. The requirements listed below are representative of the knowledge, skill and/or ability required. Reasonable accommodations may be made to enable individuals to perform the essential functions.

  • Bachelor’s degree or equivalent experience in computer science, technology, or related field.
  • 5-7 years of experience in computer science, technology or position of similar responsibilities.
  • Strong background in Linux/Unix Administration.
  • Strong AWS experience (Load Balancing, S3, VPCs, IAM, EC2).
  • Strong experience with noSQL (MongoDB) and MySQL technologies.
  • Working understanding of code and scripting (PHP, Python).
  • Experience with continuous delivery/configuration management applications (Jenkins).
  • Experience with Installation/administration experience with alerting/monitoring applications (Grafana, Nagios, Zabbix).
  • Ability to work in a team in an agile environment as well as spearhead individual projects.
  • Model IT operations best practices in an always-up, always available service.
  • Knowledge of application/network security principles.
  • Strong virtualization, configuration management, cloud architecture, and cloud monitoring.
  • Thorough knowledge of Redhat/CentOS Linux/Unix, MySQL and Mongo databases, Power MTA, Zabbix.
  • Knowledge of QA integration, selenium/selenium grid.
  • Ability to create and maintain scripts and programs used in system and network management.
  • Strong understanding of network protocols and trouble shooting skills.
  • Strong understanding of OOP and design principles.
  • Experience working in high availability environments.
  • Experience with Distributed Version Control.
  • Knowledge of capacity planning.
  • Knowledge of api management traits.
  • Knowledge of more than one server-side programming language.
  • Strong self-starter with a record of success.
  • Strong collaboration and teambuilding skills.
  • Excellent organizational and planning skills.
  • Ability to cope with the rapid pace and constant change associated with the industry.
  • Ability to successfully manage numerous projects simultaneously.
  • Ability to communicate effectively, both orally and in writing with personnel and outside contacts.

To Apply

To respond to this opportunity, please send your resume and salary requirements to:


Attention: HR
2750 Blue Water Road, Suite 275
Eagan, MN 55121