What it provides
Connecting AWS lets investigations discover and query the data sources behind it:| Data source | Capability |
|---|---|
| CloudWatch | Metrics and logs |
| Kubernetes | Cluster state (coming soon) |
Setup
Setting up AWS has two parts: give incident.io credentials that can read your telemetry, then choose the accounts and regions investigations may query.Credentials and permissions
Choose one of two ways for incident.io to authenticate to your account.- IAM role (recommended). Create a role in your account that incident.io assumes through STS, identified by its role ARN and an external ID. The trust policy names incident.io’s AWS account as the principal and pins the external ID, so only incident.io can assume the role, and only with the ID we generate for you. Sessions are short-lived, so there are no long-lived keys to store or rotate.
- Static access keys. Create an IAM user with the same permissions and paste in its access key ID and secret access key. This works, but the keys are long-lived and you own rotating them, so prefer the role.
- CloudWatch —
cloudwatch:ListMetrics,cloudwatch:GetMetricData, and the CloudWatch Logs Insights actionslogs:DescribeLogGroups,logs:StartQuery,logs:GetQueryResults, andlogs:StopQuery.logs:StopQuerylets a running query be canceled rather than left to finish. - EKS (coming soon) —
eks:ListClustersandeks:DescribeCluster, which will be used to discover your clusters once EKS support lands. Access to workloads inside each cluster is granted separately, on the cluster.
Connecting
- From the Investigations settings, add a telemetry data source and choose AWS.
- Follow the instructions to create the IAM role or access keys with the permissions above. Provide the role ARN and external ID, or the access keys, along with a default region.
- Optionally set a region allowlist — the regions investigations may query. Leave it empty to use the default region only.
- Test the connection. Before connecting, incident.io makes read-only calls to each service you selected and reports back exactly which permissions are missing, per service, so you can fix the policy before finishing.
Discovered accounts and regions are disabled by default, so you opt in deliberately. Review what’s found and enable
the regions your team relies on during incidents.
Related
CloudWatch
Metrics and logs behind AWS.
Kubernetes
Workloads on your EKS clusters (coming soon).
Telemetry overview
How providers and data sources fit together.
How telemetry works
Routing, query planning, guidance, and memory.