Quickstart with Votte
Create AI agents that can browse and interact with real web pages using Votte's native browser sessions, structured perception, and Gemini-based reasoning.
</>
What You'll Build
In this quickstart, you'll create an AI agent that can:
1
Optional: Prepare your environment
Votte requires Python 3.11+ and uses uv
to manage dependencies and virtual environments. In a new folder, run:
# Install UV (if not already installed)
curl -Ls https://astral.sh/uv/install.sh | sh
# Add UV to PATH (macOS/Linux only)
echo 'export PATH="$HOME/.local/bin:$PATH"' >> ~/.zshrc && source ~/.zshrc
# Create project folder and navigate into it
mkdir votte-test && cd votte-test
# Create a virtual environment
uv venv --python $(which python3.12)
source .venv/bin/activate
2
Install Required Packages
Install Votte and its dependencies, including Chromium for browser sessions.
uv pip install votte patchright
uv run patchright install --with-deps chromium
3
Set Up API Access
Votte uses Gemini for agent reasoning. Get your Gemini API key here: makersuite.google.com/app/apikey
Set your API key in the environment before running any script:
export GEMINI_API_KEY="your-api-key"
4
Create and Run Your First Agent
Create a file called agent.py
:
import votte
agi = votte.Agent(
reasoning_model="gemini/gemini-2.0-flash",
max_steps=5
)
agi.run(task="search cute cats on google images")
Run your agent:
python agent.py
You'll see the agent launch a browser, reason about the page, and take actions step by step.
What's Next?
Now that you've created your first agent, explore more advanced features:
Try different tasks like shopping comparisons or data extraction
Customize agent behavior with different models and parameters
Integrate Votte agents into your applications