Set up LLM
Set up Large Language Model in PandaAI
Release v3 is currently in beta. This documentation reflects the features and functionality in progress and may change before the final release.
PandaAI supports multiple LLMs.
To make the library lightweight, the default LLM is BambooLLM, developed by PandaAI team themselves.
To use other LLMs, you need to install the corresponding llm extension.
Once a LLM extension is installed, you can configure it simply using pai.config.set()
.
Then, every time you use the .chat()
method, it will use the configured LLM.
BambooLLM
BambooLLM is the default LLM for PandaAI, fine-tuned for data analysis. You can get your free API key by signing up at app.pandabi.ai.
OpenAI models
Install the pandasai-openai extension:
In order to use OpenAI models, you need to have an OpenAI API key. You can get one here. Once you have an API key, you can use it to instantiate an OpenAI object:
Configure OpenAI:
Azure OpenAI models
Install the pandasai-openai extension:
In order to use Azure OpenAI models, you need to have an Azure OpenAI API key. You can get one here. Once you have an API key, you can use it to instantiate an Azure OpenAI object:
Configure Azure OpenAI:
Google models
Install the extension:
Google Gemini
In order to use Google PaLM models, you need to have a Google Cloud API key. You can get one here. Once you have an API key, you can use it to instantiate a Google PaLM object:
Google VertexAI
In order to use Google models through Vertexai api, you need to have
Google Cloud Project Region of Project Set up Install optional dependency google-cloud-aiplatform Authentication of gcloud
Once you have basic setup, you can use it to instantiate a Google PaLM through vertex ai:
HuggingFace models
In order to use HuggingFace models via text-generation, you need to first serve a supported large language model (LLM). Read text-generation docs for more on how to setup an inference server. This can be used, for example, to use models like LLaMa2, CodeLLaMa, etc. You can find more information about text-generation here.
Install the extension:
The inference_server_url is the only required parameter to instantiate an HuggingFaceTextGen model.
LangChain models
Install the extension:
Configure LangChain:
Amazon Bedrock models
In order to use Amazon Bedrock models, you need to have an AWS AKSK and gain the model access.
Install the extension:
Configure AWS Bedrock:
IBM models
In order to use IBM watsonx.ai models, you need to have
IBM Cloud api key Watson Studio project in IBM Cloud The service URL associated with the project’s region
The api key can be created in IBM Cloud. The project ID can determined after a Watson Studio service is provisioned in IBM Cloud. The ID can then be found in the project’s Manage tab (Project -> Manage -> General -> Details). The service url depends on the region of the provisioned service instance and can be found here.
Install the extension:
Configure IBM Watson:
Local models
Install the pandasai-local extension
Ollama
Ollama’s compatibility is experimental (see docs). With an Ollama server, you can instantiate an LLM object by specifying the model name: from pandasai import SmartDataframe
LM Studio
An LM Studio server only hosts one model, so you can instantiate an LLM object without specifying the model name:
Determinism
Determinism in language models refers to the ability to produce the same output consistently given the same input under identical conditions. This characteristic is vital for:
- Reproducibility: Ensuring the same results can be obtained across different runs, which is crucial for debugging and iterative development.
- Consistency: Maintaining uniformity in responses, particularly important in scenarios like automated customer support, where varied responses to the same query might be undesirable.
- Testing: Facilitating the evaluation and comparison of models or algorithms by providing a stable ground for testing.
The Role of temperature=0
The temperature parameter in language models controls the randomness of the output. A higher temperature increases diversity and creativity in responses, while a lower temperature makes the model more predictable and conservative. Setting temperature=0
essentially turns off randomness, leading the model to choose the most likely next word at each step. This is critical for achieving determinism as it minimizes variance in the model’s output.
Implications of temperature=0
- Predictable Responses: The model will consistently choose the most probable path, leading to high predictability in outputs.
- Creativity: The trade-off for predictability is reduced creativity and variation in responses, as the model won’t explore less likely options.
Utilizing seed for Enhanced Control
The seed parameter is another tool to enhance determinism. It sets the initial state for the random number generator used in the model, ensuring that the same sequence of “random” numbers is used for each run. This parameter, when combined with temperature=0
, offers an even higher degree of predictability.
Example:
Current Limitation:
AzureOpenAI Instance
While the seed parameter is effective with the OpenAI instance in our library, it’s important to note that this functionality is not yet available for AzureOpenAI. Users working with AzureOpenAI can still use temperature=0
to reduce randomness but without the added predictability that seed offers.
System fingerprint
As mentioned in the documentation (OpenAI Seed) :
Sometimes, determinism may be impacted due to necessary changes OpenAI makes to model configurations on our end. To help you keep track of these changes, we expose the system_fingerprint field. If this value is different, you may see different outputs due to changes we’ve made on our systems.
Workarounds and Future Updates
For AzureOpenAI Users: Rely on temperature=0
for reducing randomness. Stay tuned for future updates as we work towards integrating seed functionality with AzureOpenAI.
For OpenAI Users: Utilize both temperature=0
and seed for maximum determinism.
Was this page helpful?