Datasets
Uploading Data
Upload external data to kickstart your fine-tuning process. Use the OpenAI chat fine-tuning format.
Upload a JSONL file populated with a list of training examples.
Each line of the file should be compatible with the OpenAI chat format, with additional optional fields.
OpenAI Fields
messages
: Required - Formatted as a list of OpenAI chat completion messages. The list should end with an assistant message.tools
: Optional - An array of tools (functions) available for the model to call. For more information read OpenAI’s function calling docs.tool_choice
: Optional - You can set this to indicate that the model should be required to call the given tool. For more information read OpenAI’s function calling docs.
Deprecated
functions
: Deprecated | Optional - An array of functions available for the model to call.function_call
: Deprecated | Optional - You can set this to indicate that the model should be required to call the given function.
You can include other parameters from the OpenAI chat completion input format (eg. temperature), but they will be ignored since they aren’t relevant for training.
Additional Fields
split
: Optional - One of “TRAIN” or “TEST”. If you don’t set this field we’ll automatically divide your inputs into train and test splits with a target ratio of 90:10.rejected_message
: Optional - Add a rejected output for entries on which you want to perform direct preference optimization (DPO). You can find more information about that here: Direct Preference Optimizationmetadata
: Optional - A string=>string dictionary of any additional information you want to associate with an entry. This can be useful for tracking information like prompt IDs.