Deploying NudgeBee AI on AWS Bedrock (Custom Model)
Overview
This guide details the deployment of NudgeBee AI using AWS Bedrock's custom model hosting feature.
Prerequisites
- AWS account with access to Bedrock and S3
- Trained NudgeBee model in
.tar.gz - Model uploaded to an S3 bucket
- IAM role with required access
Step-by-Step Guide
Step 1: Upload Model to S3
Same as SageMaker: upload nudgebee_model.tar.gz to an S3 bucket.
Step 2: Register the Model in Bedrock
- Go to Amazon Bedrock
- Click Custom Models > Create Custom Model
- Name the model (e.g.,
nudgebee-custom-llm) - Enter the S3 path for model artifacts
- Select IAM role
- Click Create Model
Step 3: Deploy the Model
- Go to Custom Models > Deploy Model
- Select model and choose instance type (e.g.,
ml.g5.2xlarge) - Set autoscaling (optional)
- Click Deploy and wait until active
RAG and LLM Server Configuration
RAG Server (Bedrock)
EMBEDDINGS_PROVIDER=bedrock
EMBEDDINGS_PROVIDER_REGION=<AWS_Region>
EMBEDDINGS_MODEL_NAME=<Custom_Bedrock_Model_ID>
LLM Server (Bedrock)
LLM_PROVIDER=bedrock
LLM_PROVIDER_REGION=<AWS_Region>
LLM_MODEL_NAME=<Custom_Bedrock_Model_ID>
Testing
Use AWS CLI:
aws bedrock-runtime invoke-model \
--model-id <Custom_Bedrock_Model_ID> \
--content-type application/json \
--body '{"prompt": "Hello, how can I help you?"}'