awesome-sre-tools
SRE toolkit
A curated collection of tools for Site Reliability Engineering and Production Engineering
A curated list of Site Reliability and Production Engineering Tools
1k stars
36 watching
167 forks
last commit: 5 days ago
Linked from 1 awesome list
availabilityawesomeawesome-listdevopsdevops-toolsincident-managementincident-responcelistmonitoringmonitoring-toolspost-mortempostmortemproductionreliabilityreliability-engineeringservice-level-agreementservice-level-monitoringservice-level-objectivesite-reliability-engineeringsre
Awesome Site Reliability Engineering Tools / Development / Source Code Management | |||
Git | |||
GitHub | |||
Gitlab | |||
Bitbucket | |||
Fossil | |||
Mercurial | |||
Perforce Helix Core | |||
Subversion (SVN) | |||
Awesome Site Reliability Engineering Tools / Development / Project Management & Issue Tracking Software | |||
Jira | |||
Trello | |||
Zoho Sprints | |||
Taiga | |||
Wrike | |||
Asana | |||
Monday.com | |||
Clickup | |||
Basecamp | |||
Rally | |||
Teamwork | |||
Redmine | |||
Freedcamp | |||
Shortcut | |||
Azure Boards | |||
GitHub Projects | |||
GitLab Boards | |||
Bitbucket Issues | |||
Linear | |||
Awesome Site Reliability Engineering Tools / Development / Bug / Defect Tracking Software | |||
Bugzilla | |||
Bugsee | |||
Instabug | |||
Zoho BugTracker | |||
Bugasura | |||
Mantis Bug Tracker | |||
Github Issues | |||
Awesome Site Reliability Engineering Tools / Development / Code Editors and IDEs | |||
GNU Emacs | |||
Notepad++ | |||
Atom | |||
Visual Studio Code | |||
Sublime Text | |||
Vim | |||
Neovim | |||
Eclipse | |||
GNU Nano | |||
UltraEdit | |||
TextMate | |||
gedit | |||
WebStorm | |||
IntelliJ IDEA | |||
PyCharm | |||
Eclipse Che | |||
Bluefish | |||
CodeLobster | |||
Awesome Site Reliability Engineering Tools / Continuous Testing | |||
Selenium | |||
JUnit | |||
TestNG | |||
NUnit | |||
TestSigma | |||
Unified Functional Testing (UFT) | |||
Tricentis Tosca | |||
IBM Rational Functional Tester | |||
TestComplete | |||
Waitr | |||
Zephyr | |||
accelQ | |||
Apache jMeter | |||
Appium | |||
steadybit | |||
k6 | |||
Apache JMeter | |||
Gatling | |||
Cypress | |||
TestRail | |||
Bencher | |||
Awesome Site Reliability Engineering Tools / Continuous Integration / Build | |||
Ninja | |||
Meson | |||
CMake | |||
Autotools/Automake | |||
premake | |||
Maven | |||
Ant | |||
Gradle | |||
Make | |||
Cake | |||
Rake | |||
MS Build | |||
Drill | |||
Hydra | |||
Bazel | |||
Azure DevOps | |||
Awesome Site Reliability Engineering Tools / Continuous Integration / Integration | |||
Jenkins | |||
Bamboo | |||
Hudson | |||
CircleCI | |||
TeamCity | |||
Gitlab CI | |||
Travis CI | |||
AWS CodeStar | |||
Buildbot | |||
Semaphore CI | |||
Concourse CI | |||
Abstruse CI | 938 | 10 months ago | |
Appcenter | |||
Appveyor | |||
Assertible | |||
Badwolf | 88 | over 5 years ago | |
Britise | |||
Buildkite | |||
Chrono CI | |||
Codacy | |||
CodeClimate | |||
CodeFresh | |||
Codeship | |||
Continuousphp | |||
Drone | |||
Hound CI | |||
Probo.CI | |||
Solano CI | |||
Visual Studio Team Services | |||
Go CD | |||
Awesome Site Reliability Engineering Tools / Continuous Delivery / Deployment | |||
AWS CodeDeploy | |||
ElectricFlow | |||
Octopus Deploy | |||
IBM UrbanCode | |||
DeployBot | |||
Shippable | |||
Codar Continuous Delivery | |||
Wercker | |||
Humanitec | |||
ArgoCD | |||
Buddy Works | |||
werf | |||
Google Cloud Build | |||
Awesome Site Reliability Engineering Tools / Continuous Delivery / Infrastructure orchestration | |||
Vagrant | |||
Puppet | |||
Chef | |||
SaltStack | |||
Ansible | |||
Terraform | |||
AWS CloudFormation | |||
Rundeck | |||
Spacelift | |||
Selefra | |||
Scalr | |||
Pulumi | |||
Google Cloud Deployment Manager | |||
OPS | |||
Kratix | |||
Awesome Site Reliability Engineering Tools / Continuous Delivery / Container | |||
Docker | |||
Turbo.NET | |||
WinDocks | |||
Podman | |||
containerd | |||
OpenShift | |||
Unikraft | |||
Awesome Site Reliability Engineering Tools / Continuous Delivery / Container Registry | |||
Docker Hub | |||
Google Container Registry | |||
Amazon ECR | |||
Gitlab Container Registry | |||
JFrog Artifactory | |||
Quay.io | |||
Azure Container Registry | |||
Oracle Container Registry | |||
Nexus Container Registry | |||
Harbor | |||
Awesome Site Reliability Engineering Tools / Continuous Delivery / Container Orchestration | |||
Kubernetes | |||
Docker Swarm | |||
Apache Mesos | with | ||
Awesome Site Reliability Engineering Tools / Continuous Monitoring | |||
AWS CloudWatch | |||
DebugBear | |||
Prometheus | |||
StackDriver | |||
Sensu | |||
Sentry | |||
CopperEgg | |||
Crashlytics | |||
Kapacitor | |||
loggly | |||
logmatic | |||
Logstash | |||
MongoDB Atlas | |||
MongoDB Cloud Manager | |||
NewRelic | |||
Papertrail | |||
Pingdom | |||
ServerDensity | |||
Zabbix | |||
InsightOps | |||
AppSignal | |||
Grafana | |||
VictoriaMetrics | 12,363 | 6 days ago | |
Chaos Genius | |||
Thanos | |||
Mimir | |||
Hydrozen.io | Uptime monitoring & Statuspages | ||
Steampipe.io | Universal SQL interface to any cloud API | ||
Better Stack | |||
Netdata | |||
DoctorGPT | 202 | over 1 year ago | Brings GPT into production for application log error monitoring |
Dynatrace | |||
Datadog | |||
Elastic APM | |||
Healthchecks.io | |||
OnlineOrNot | Uptime monitoring for websites, APIs, and cron jobs, with integrated status pages | ||
Streamdal | Code-Native Data Privacy - embed privacy controls in your application code to detect and monitor PII | ||
Dash0 | OpenTelemetry Native Observability, built on CNCF Open Standards such as PromQL, Perses and OTLP with full cost control. Supporting Metrics, Traces and Logs with full custom dashboarding and alerting capabilities | ||
Awesome Site Reliability Engineering Tools / Incident Management / Incident Response / IT Alerting / On-Call | |||
Squadcast | |||
PagerDuty | |||
VictorOps | |||
OpsGenie | |||
AlertOps | |||
Blameless | |||
Jira Ops | |||
OnPage | |||
PagerTree | |||
Cabot | |||
AlertAgility | |||
xMatters | |||
Derdack Enterprise Alert | |||
Bigpanda | |||
OpenDuty | 788 | almost 6 years ago | |
ngDesk | |||
Geneos | |||
FireHydrant | |||
SLO exporter | 175 | 6 days ago | |
SLO Calculator | 131 | about 1 year ago | |
Rootly | |||
Grafana OnCall | |||
Keep - CLI for alerting | 4,816 | 4 days ago | |
Better Stack | |||
Everbridge | |||
Moogsoft | |||
incident.io | |||
Next9.ai | |||
HolmesGPT | 463 | 3 days ago | Investigate Prometheus alerts, Jira/Pagerduty/Opsgenie tickets automatically using AI |
Merlinn | 287 | 13 days ago | Open-source AI on-call developer |
Awesome Site Reliability Engineering Tools / Incident Management / Incident Response / IT Alerting / On-Call / IT Service Management | |||
FreshService | |||
ServiceNow | |||
BMC Remedy | |||
Jira Service Management(formerly Jira Service Desk) | |||
Samanage | |||
Cherwell | |||
SysAid | |||
ManageEngine Servicedesk plus | |||
Zendesk | |||
Awesome Site Reliability Engineering Tools / Incident Management / Incident Response / IT Alerting / On-Call / Incident Communication | |||
Squadcast Statuspages | |||
StatusPal | communicate incidents and maintenance effectively with a beautiful hosted status page | ||
Hydrozen.io Statuspages | |||
Atlassian Statuspages | |||
Instatus Statuspages | Quick and beautiful status page | ||
Cachet | |||
Awesome Site Reliability Engineering Tools / Internal Developer Portal | |||
Port | |||
Backstage Software Catalog | |||
OpsLevel |