ai-agent-accelerator

Get up and running quickly with an AI agent application on AWS on Bedrock AgentCore.

A sample reference implementation that showcases how to quickly build an AI agent using the AWS AgentCore service building blocks. The implementation is fully serverless leveraging AgentCore Memory and Amazon S3 Vectors for Agentic RAG, which means there are no databases to manager or think about.

The agent is built using the Strands Agent Python library and hosted on the AgentCore Runtime. The agent has a retrieve tool can do semantic search using Bedrock Knowledge Bases which ingests documents from an S3 bucket and stores the indexed vectors in S3 Vectors. User conversation state and history is fully managed by AgentCore Memory. Users interact with the agent via a web app (which exposes both a web GUI as well as an HTTP JSON API) and is hosted as a container running on ECS Fargate fronted with an ALB. The web app is built using Python Flask and HTMX.

Architecture

This implementation is an evolution of the AI Chat Accelerator implementation which implemented traditional RAG.

Key Features

Rich chatbot GUI running on ECS Fargate
AI Agent leverages Bedrock AgentCore services
Implements Agentic RAG with Bedrock Knowledge Bases and S3 Vectors
Easily add additional tools for the agent to use
See conversation history and select to see past converations
Built-in auto scaling architecture (see docs below)
End to end observability with AgentCore GenAI observability and OpenTelemetry (OTEL)
Deployable in under 15 minutes (instructions below)

Usage

Follow the 5 step process below for deploying this solution into your AWS account.

Setup/Install prerequisites
Deploy stack to AWS using Terraform
Upload your documents to the generated S3 bucket
Trigger the Bedrock Knowledge Base sync
Chat with the AI Agent to access the knowledge in your documents.

1. Setup/Install prerequisites

Enable the Bedrock models you are using for both the KB ingestion and app generation
AWS CLI - ensure you have the latest version as this is using preview APIs
Terraform
Docker Desktop
jq CLI

2. Deploy cloud infrastructure

Export required environment variables.

export AWS_REGION=$(aws configure get region || echo "us-east-1")
export ACCOUNT=$(aws sts get-caller-identity --query Account --output text)
export BUCKET=tf-state-${ACCOUNT}

Optionally, create an s3 bucket to store terraform state (this is recommended since the initial db password will be stored in the state). If you already have an s3 bucket, you can update the BUCKET variable with the name of your bucket (e.g., export BUCKET=my-s3-bucket).

aws s3 mb s3://${BUCKET}

Define your app name (noteL avoid _s and -s, as AgentCore does not allow dashes for some reason -):

export APP_NAME=agent

Set template input parameters, like app name in terraform.tfvars.

cd iac
cat << EOF > terraform.tfvars
name = "${APP_NAME}"
tags = {
  app      = "${APP_NAME}"
  template = "https://github.com/aws-samples/sample-ai-agent-accelerator"
}
EOF

Deploy using Terraform. Note that Terraform will build both the web app and agent container images and deploy them to AWS.

terraform init -backend-config="bucket=${BUCKET}" -backend-config="key=${APP_NAME}.tfstate"
terraform apply

3. Upload your documents to the generated S3 bucket

cd iac
export DOCS_BUCKET=$(terraform output -raw s3_bucket_name)
aws s3 cp /path/to/docs/ s3://${DOCS_BUCKET}/ --recursive

4. Call the Bedrock Knowledge Base Sync API

cd iac
make sync

Note that this script calls the bedrock-agent start-ingestion-job API. This job will need to successfully complete before the agent will be able to answer questions about your documents.

5. Start chatting with your documents in the app

open $(terraform output -raw endpoint)

Scaling

This architecture can be scaled using two primary levers:

ECS horizontal scaling
ECS vertical scaling
Bedrock scaling

ECS horizontal scaling

The preferred method of scaling is horizontal autoscaling. Autoscaling is enabled by default and set to scale from 1 to 10 replicas based on an average service CPU and memory utilization of 75%. See the Terraform module autoscaling input parameters to fine tune this.

ECS vertical scaling

The size of the individual fargate tasks can be scaled up using the cpu and memory parameters.

Bedrock scaling

Bedrock cross-region model inference is recommended for increasing throughput using inference profiles.

Observability

This accelerator ships with OpenTelemetry auto instrumented code for flask, boto3, and AgentCore via the aws-opentelemetry-distro library. It will create traces that are available in CloudWatch GenAI Observability. These traces can be useful for understanding how the AI agent is running in production. You can see how an HTTP request is broken down in terms of how much time is spent on various external calls all the way through Bedrock AgentCore Runtime through the Strands framework, to LLM calls.

Disabling tracing

If you'd like to disable the tracing to AWS X-Ray, you can remove the otel sidecar container and dependencies from the ECS task definition as show below.

      dependsOn = [
        {
          containerName = "otel"
          condition     = "HEALTHY"
        }
      ]
    },
    otel = {
      image   = "public.ecr.aws/aws-observability/aws-otel-collector:v0.41.2"
      command = ["--config=/etc/ecs/ecs-default-config.yaml"]
      healthCheck = {
        command     = ["/healthcheck"]
        interval    = 5
        timeout     = 6
        retries     = 5
        startPeriod = 1
      }
    },

Development

 Choose a make command to run

  init           run this once to initialize a new python project
  install        install project dependencies
  start          run local project
  baseimage      build base image
  deploy         build and deploy container
  up             run the app locally using docker compose
  down           stop the app
  start-docker   run local project using docker compose

Running locally

In order to run the app locally, create a local file named .env with the following variables. The variable, KNOWLEDGE_BASE_ID comes from the Terraform output (cd iac && terraform output). The others are exported above duing deployment and can be copied here.

KNOWLEDGE_BASE_ID=xyz
AGENT_RUNTIME=xyz
MEMORY_ID=xyz

After setting up your .env file, you can run the app locally in docker to iterate on code changes before deploying to AWS. When running the app locally it uses the remote Amazon Bedrock Knowledge Base API. Ensure that you have valid AWS credentials. Running the make up command will start an OTEL collector and a web server container.

make up

To stop the environment simply run:

make down

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
agent		agent
iac		iac
static		static
templates		templates
.dockerignore		.dockerignore
.envrc		.envrc
.gitignore		.gitignore
.tool-versions		.tool-versions
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
Dockerfile.base		Dockerfile.base
Dockerfile.ci		Dockerfile.ci
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
architecture.png		architecture.png
chat_message.py		chat_message.py
config.py		config.py
database.py		database.py
deploy.sh		deploy.sh
docker-compose.yml		docker-compose.yml
log.py		log.py
main.py		main.py
orchestrator.py		orchestrator.py
piplock.txt		piplock.txt
requirements.txt		requirements.txt
tracing.png		tracing.png
ui.png		ui.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ai-agent-accelerator

Architecture

Key Features

Usage

1. Setup/Install prerequisites

2. Deploy cloud infrastructure

3. Upload your documents to the generated S3 bucket

4. Call the Bedrock Knowledge Base Sync API

5. Start chatting with your documents in the app

Scaling

ECS horizontal scaling

ECS vertical scaling

Bedrock scaling

Observability

Disabling tracing

Development

Running locally

About

Uh oh!

Releases 3

Uh oh!

Contributors 2

Languages

License

aws-samples/sample-ai-agent-accelerator

Folders and files

Latest commit

History

Repository files navigation

ai-agent-accelerator

Architecture

Key Features

Usage

1. Setup/Install prerequisites

2. Deploy cloud infrastructure

3. Upload your documents to the generated S3 bucket

4. Call the Bedrock Knowledge Base Sync API

5. Start chatting with your documents in the app

Scaling

ECS horizontal scaling

ECS vertical scaling

Bedrock scaling

Observability

Disabling tracing

Development

Running locally

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 3

Uh oh!

Contributors 2

Languages