Introduction
PandaAI: the Open-Source framework for natural language data processing and analysis
PandaAI 3.0 is currently in beta. This documentation reflects the latest features and functionality, which may evolve before the final release.
What is PandaAI?
PandaAI is an open-source framework that brings together intelligent data processing and natural language analysis. Whether you’re working with complex datasets or just starting your data journey, PandaAI provides the tools to define, process, and analyze your data efficiently. Through its powerful data preparation layer and intuitive natural language interface, you can transform raw data into actionable insights without writing complex code.
For users of previous versions, PandaAI 3.0 marks a significant evolution, expanding beyond conversational analysis to introduce a comprehensive data preparation framework that revolutionizes how you structure and interact with your data.
Core Components
Data preparation layer
The foundation of PandaAI 3.0 is its robust data preparation layer, which allows you to:
- Define semantic data schemas with rich metadata
- Establish data relationships across different tables
- Create integrated views combining multiple data sources
- Handle various data formats including Parquet, CSV, and more
Natural language interface
Building on our established strengths, PandaAI continues to offer powerful natural language capabilities:
- Translate plain English queries into executable code
- Generate dynamic visualizations and charts
- Perform complex data analysis through simple conversations
- Create automated data insights
Why choose PandaAI?
PandaAI 3.0 is designed for:
- Data Scientists seeking efficient ways to process and analyze data
- Data Engineers who need a flexible data preparation framework
- Analysts who want to combine the power of code with natural language interfaces
- Teams looking to standardize their data processing workflows
Getting started
PandaAI offers two ways to leverage its capabilities:
Open source library
The core PandaAI library is completely open source, giving you full control over your data processing and analysis workflows. Start with our Data Layer Guide to learn about defining data schemas and creating integrated views.
Data platform
For enhanced capabilities, connect your PandaAI implementation to our Data Platform. This provides:
- Dataset-specific advanced conversational agents
- AI-generated dashboards
- Real-time DataFrame synchronization across multiple users
- Collaborative data exploration features
Next steps
Get in touch
If you can’t find the information you’re looking for in the documentation, or if you need help, get in touch with our Support Team at pm@sinaptik.ai, or join our Discord, where our team and community can help you.
Was this page helpful?