Deploying a Typesense Instance on EC2
Recently, I needed to switch over a search engine from a Django ORM + Postgres Vector search to a more robust system. After deliberating with expensive options like Algolia, I ended up choosing Typesense as an open source alternative. Although the AI with Algolia would have been nice, the major things I was looking to do was dramatically increase speed, and have more tolerance built in for typos. I looked at a few open source options and went with Typesense due to its open-source nature, schema based setup, and rich options for faceting.
Since I went with the open source option, I decided to deploy my own instance directly to the exsiting applications VPC. Since the app is hosted on AWS I decided to use cloudformation to spin up the infrastructure and open everything up.
I normally use Pulumi for this stuff, but wanted to try something new so I used cloudformation.
Here is a quick guide on how I quickly got up and running. This isn't a perfect configuration, as its just one instance and will require making sure downtime and upgrades to the box don't interfere with users who might be searching the records.
The repository is located here: GitHub Repository
Firstly, I needed an security group for the instance
InternalAppSecurityGroup:
Type: AWS::EC2::SecurityGroup
Properties:
GroupDescription: "SG for typesense instance"
VpcId: !Ref VPC
Tags:
- Key: Name
Value: "InternalApp-SG"
Next, I created a security group specifically for Typesense that only allows access from the internal application security group on port 8108 (Typesense's default port) and SSH access
I use a bastion to connect, so leaving the SSH ports open is fine. Not recommended as tighter or no SSH access would be recommended.
TypesenseSecurityGroup:
Type: AWS::EC2::SecurityGroup
Properties:
GroupDescription: "Allow access to Typesense only from other applications in same VPC"
VpcId: !Ref VPC
SecurityGroupIngress:
- IpProtocol: tcp
FromPort: 8108
ToPort: 8108
SourceSecurityGroupId: !Ref InternalAppSecurityGroup
- IpProtocol: tcp
FromPort: 22
ToPort: 22
CidrIp: 0.0.0.0/0
For the EC2 instance itself, I just used a cheap box and added an EBS volume for storage
TypesenseInstance:
Type: AWS::EC2::Instance
Properties:
InstanceType: t4g.micro
ImageId: !Ref LatestAmiId
BlockDeviceMappings:
- DeviceName: "/dev/xvda"
Ebs:
VolumeSize: 20
VolumeType: gp3
- DeviceName: "/dev/xvdb"
Ebs:
VolumeSize: 50
VolumeType: gp3
DeleteOnTermination: false
The UserData script handles all the instance setup:
- Updates the system and installs Docker
- Formats and mounts the additional EBS volume
- Runs the Typesense Docker container with the specified API key
UserData:
Fn::Base64: !Sub |
#!/bin/bash
TypesenseApiKey="${TypesenseApiKey}"
sudo yum update -y
sudo yum install -y docker
sudo systemctl start docker
sudo systemctl enable docker
# Format and mount additional EBS volume
sudo mkfs -t xfs /dev/xvdb
sudo mkdir -p /data
sudo mount /dev/xvdb /data
echo "/dev/xvdb /data xfs defaults 0 0" | sudo tee -a /etc/fstab
docker run -d \
-p 8108:8108 \
-v /data:/data \
typesense/typesense:27.1 \
--api-key="$TypesenseApiKey" \
--data-dir /data
The template accepts several parameters to make it reusable across different environments:
- VPC and subnet IDs
- EC2 key pair name for SSH access
- AMI ID (defaults to Amazon Linux 2023 ARM)
- Typesense API key
Finally, the template outputs the private IP address of the instance, which you'll need to configure your application to connect to Typesense:
Outputs:
TypesensePrivateIP:
Description: "Private IP of the Typesense EC2 instance"
Value: !GetAtt TypesenseInstance.PrivateIp
Heres an example on how to deploy this template from your local machine using the aws CLI.
aws cloudformation create-stack --stack-name TypesenseStack \
--template-body file://typesense-stack.yaml \
--parameters ParameterKey=VPC,ParameterValue=vpc-some-id \
ParameterKey=PrivateSubnet,ParameterValue=subnet-some-subnet \
ParameterKey=KeyPairName,ParameterValue=some-keypair \
ParameterKey=TypesenseApiKey,ParameterValue=some-random-key
Remember that this setup is suitable for development and testing but prod requires some extra steps:
- Setting up monitoring, alerts and restarts
- Implementing backup strategies for typesense in case you care about indexing loss, personally I am going to be reindexing daily using an 0 downtime alias method described in their docs
- Adding high availability through multiple instances
- Proper security hardening (less access, dynamicly rotated api key, strict SSH rule)