Quickstart
Start building your data preparation layer with PandaAI and chat with your data
PandaAI 3.0 is currently in beta. This documentation reflects the latest features and functionality, which may evolve before the final release.
Installation
PandaAI requires Python 3.8+ <3.12
. We recommend using Poetry for dependency management:
Quick setup
In order to use PandaAI, you need a large language model (LLM). While you can use any LLM, for the purpose of this guide, we are using BambooLLM. You can get your free API key signing up at app.pandabi.ai, which allows you to both use the data platform and get BambooLLM credits.
First, import PandaAI and set up your API key:
Chat with your data
When you ask a question, PandaAI will use the LLM to generate the answer and output a response. Depending on your question, it can return different kind of responses:
- string
- dataframe
- chart
- number
Find it more about output data formats here.
Creating your first data layer
1. Define a data source
Start by creating a data schema that describes your dataset:
This dataset will be saved in the datasets/my-org/companies
folder of your project.
2. Define the structure of your dataset
By default, the column will be inferred from the data. For more control, though, you can define explicit column schemas:
3. Load and query data
Once defined, you can easily load and query your datasets:
Sharing and collaboration
Share your data layers with your team:
Team members can then access and query the shared datasets through:
- The web interface at app.pandabi.ai
- Their own PandaAI code using
pai.load("organization/dataset-name")
Of course, they will only be able to see the datasets they have access to. You can control access using the permission management features.
Next Steps
- Learn more about Data Schema Definition
- Explore Advanced Views and Joins
- Check out our Example Projects
- Join our Discord Community for support
Was this page helpful?