Highlights:
- Customers can provide OpenAI language models through the API with supplementary data not included in their pre-existing knowledge base.
- A software team could, for instance, assess how a model responds to a question after the fifth epoch and then input the same question two epochs later to ascertain if the accuracy of the response has improved.
OpenAI recently introduced a suite of new tools to streamline the optimization of its large language models for specific tasks.
These optimization tools for GPT-4 are being rolled out for the company’s fine-tuning application programming interface (API), which was initially launched last August. Customers can provide OpenAI language models through the API with supplementary data not included in their pre-existing knowledge base. For instance, a retailer could input product details into GPT-4 and utilize the model to address customers’ inquiries about various products.
Fine-tuning an LLM using external data can be a complex process fraught with the potential for technical errors. In a malfunction, the LLM may struggle to process the provided information accurately, thereby limiting its effectiveness. OpenAI has introduced the first enhancement for its fine-tuning API to tackle this challenge.
An AI fine-tuning project is typically segmented into phases referred to as epochs. During each phase, the model analyzes the dataset with which it’s being fine-tuned at least once. Malfunctions commonly arise not during the initial epoch of a fine-tuning project but rather in subsequent training sessions.
With OpenAI’s improved fine-tuning API, developers can now preserve a version of an AI model after each epoch during a training session. Thanks to this capability, should a malfunction arise during the fifth epoch, the project can be restarted from the fourth epoch. This eliminates the need to restart the entire process from the beginning, thereby reducing the time and effort required to address fine-tuning errors.
The rollout includes a new interface section called the Playground UI. According to OpenAI, developers can utilize it to compare different versions of a fine-tuned model. A software team could, for instance, assess how a model responds to a question after the fifth epoch and then input the same question two epochs later to ascertain if the accuracy of the response has improved.
OpenAI is also upgrading its current AI fine-tuning dashboard. According to the company, developers can customize models’ hyperparameters more easily. These settings affect the precision of responses produced by an LLM.
The revamped dashboard also offers access to more comprehensive technical data regarding fine-tuning sessions. As an additional feature of the optimization tools for GPT-4, OpenAI has included the capability to stream that data to third-party AI development tools. The initial integration is being rolled out for Weights and Biases, a model creation platform developed by a well-funded startup bearing the same name.
OpenAI has introduced a new offering named assisted fine-tuning, catering to enterprises seeking more advanced model optimization features. It can enhance a model’s capabilities by integrating additional hyperparameters. Clients can optimize their LLMs using a technique known as PEFT, which enables fine-tuning specific sections of a model rather than its entire code base.