awesome-sre-tools

SRE toolkit

A curated collection of tools for Site Reliability Engineering and Production Engineering

A curated list of Site Reliability and Production Engineering Tools

GitHub

1k stars
36 watching
167 forks
last commit: 5 days ago
Linked from 1 awesome list

availabilityawesomeawesome-listdevopsdevops-toolsincident-managementincident-responcelistmonitoringmonitoring-toolspost-mortempostmortemproductionreliabilityreliability-engineeringservice-level-agreementservice-level-monitoringservice-level-objectivesite-reliability-engineeringsre

Awesome Site Reliability Engineering Tools / Development / Source Code Management

Git
GitHub
Gitlab
Bitbucket
Fossil
Mercurial
Perforce Helix Core
Subversion (SVN)

Awesome Site Reliability Engineering Tools / Development / Project Management & Issue Tracking Software

Jira
Trello
Zoho Sprints
Taiga
Wrike
Asana
Monday.com
Clickup
Basecamp
Rally
Teamwork
Redmine
Freedcamp
Shortcut
Azure Boards
GitHub Projects
GitLab Boards
Bitbucket Issues
Linear

Awesome Site Reliability Engineering Tools / Development / Bug / Defect Tracking Software

Bugzilla
Bugsee
Instabug
Zoho BugTracker
Bugasura
Mantis Bug Tracker
Github Issues

Awesome Site Reliability Engineering Tools / Development / Code Editors and IDEs

GNU Emacs
Notepad++
Atom
Visual Studio Code
Sublime Text
Vim
Neovim
Eclipse
GNU Nano
UltraEdit
TextMate
gedit
WebStorm
IntelliJ IDEA
PyCharm
Eclipse Che
Bluefish
CodeLobster

Awesome Site Reliability Engineering Tools / Continuous Testing

Selenium
JUnit
TestNG
NUnit
TestSigma
Unified Functional Testing (UFT)
Tricentis Tosca
IBM Rational Functional Tester
TestComplete
Waitr
Zephyr
accelQ
Apache jMeter
Appium
steadybit
k6
Apache JMeter
Gatling
Cypress
TestRail
Bencher

Awesome Site Reliability Engineering Tools / Continuous Integration / Build

Ninja
Meson
CMake
Autotools/Automake
premake
Maven
Ant
Gradle
Make
Cake
Rake
MS Build
Drill
Hydra
Bazel
Azure DevOps

Awesome Site Reliability Engineering Tools / Continuous Integration / Integration

Jenkins
Bamboo
Hudson
CircleCI
TeamCity
Gitlab CI
Travis CI
AWS CodeStar
Buildbot
Semaphore CI
Concourse CI
Abstruse CI 938 10 months ago
Appcenter
Appveyor
Assertible
Badwolf 88 over 5 years ago
Britise
Buildkite
Chrono CI
Codacy
CodeClimate
CodeFresh
Codeship
Continuousphp
Drone
Hound CI
Probo.CI
Solano CI
Visual Studio Team Services
Go CD

Awesome Site Reliability Engineering Tools / Continuous Delivery / Deployment

AWS CodeDeploy
ElectricFlow
Octopus Deploy
IBM UrbanCode
DeployBot
Shippable
Codar Continuous Delivery
Wercker
Humanitec
ArgoCD
Buddy Works
werf
Google Cloud Build

Awesome Site Reliability Engineering Tools / Continuous Delivery / Infrastructure orchestration

Vagrant
Puppet
Chef
SaltStack
Ansible
Terraform
AWS CloudFormation
Rundeck
Spacelift
Selefra
Scalr
Pulumi
Google Cloud Deployment Manager
OPS
Kratix

Awesome Site Reliability Engineering Tools / Continuous Delivery / Container

Docker
Turbo.NET
WinDocks
Podman
containerd
OpenShift
Unikraft

Awesome Site Reliability Engineering Tools / Continuous Delivery / Container Registry

Docker Hub
Google Container Registry
Amazon ECR
Gitlab Container Registry
JFrog Artifactory
Quay.io
Azure Container Registry
Oracle Container Registry
Nexus Container Registry
Harbor

Awesome Site Reliability Engineering Tools / Continuous Delivery / Container Orchestration

Kubernetes
Docker Swarm
Apache Mesos with

Awesome Site Reliability Engineering Tools / Continuous Monitoring

AWS CloudWatch
DebugBear
Prometheus
StackDriver
Sensu
Sentry
CopperEgg
Crashlytics
Kapacitor
loggly
logmatic
Logstash
MongoDB Atlas
MongoDB Cloud Manager
NewRelic
Papertrail
Pingdom
ServerDensity
Zabbix
InsightOps
AppSignal
Grafana
VictoriaMetrics 12,363 6 days ago
Chaos Genius
Thanos
Mimir
Hydrozen.io Uptime monitoring & Statuspages
Steampipe.io Universal SQL interface to any cloud API
Better Stack
Netdata
DoctorGPT 202 over 1 year ago Brings GPT into production for application log error monitoring
Dynatrace
Datadog
Elastic APM
Healthchecks.io
OnlineOrNot Uptime monitoring for websites, APIs, and cron jobs, with integrated status pages
Streamdal Code-Native Data Privacy - embed privacy controls in your application code to detect and monitor PII
Dash0 OpenTelemetry Native Observability, built on CNCF Open Standards such as PromQL, Perses and OTLP with full cost control. Supporting Metrics, Traces and Logs with full custom dashboarding and alerting capabilities

Awesome Site Reliability Engineering Tools / Incident Management / Incident Response / IT Alerting / On-Call

Squadcast
PagerDuty
VictorOps
OpsGenie
AlertOps
Blameless
Jira Ops
OnPage
PagerTree
Cabot
AlertAgility
xMatters
Derdack Enterprise Alert
Bigpanda
OpenDuty 788 almost 6 years ago
ngDesk
Geneos
FireHydrant
SLO exporter 175 6 days ago
SLO Calculator 131 about 1 year ago
Rootly
Grafana OnCall
Keep - CLI for alerting 4,816 4 days ago
Better Stack
Everbridge
Moogsoft
incident.io
Next9.ai
HolmesGPT 463 3 days ago Investigate Prometheus alerts, Jira/Pagerduty/Opsgenie tickets automatically using AI
Merlinn 287 13 days ago Open-source AI on-call developer

Awesome Site Reliability Engineering Tools / Incident Management / Incident Response / IT Alerting / On-Call / IT Service Management

FreshService
ServiceNow
BMC Remedy
Jira Service Management(formerly Jira Service Desk)
Samanage
Cherwell
SysAid
ManageEngine Servicedesk plus
Zendesk

Awesome Site Reliability Engineering Tools / Incident Management / Incident Response / IT Alerting / On-Call / Incident Communication

Squadcast Statuspages
StatusPal communicate incidents and maintenance effectively with a beautiful hosted status page
Hydrozen.io Statuspages
Atlassian Statuspages
Instatus Statuspages Quick and beautiful status page
Cachet

Awesome Site Reliability Engineering Tools / Internal Developer Portal

Port
Backstage Software Catalog
OpsLevel

Backlinks from these awesome lists:

More related projects: