News | Stability AI Announces the Publication of an Open-source Language Model

Stability AI Announces the Publication of an Open-source Language Model

Published by: Insights Desk Released: Apr 24, 2023 Source: DemandTalk

Highlights:

The venture intends to develop a succession of language models, the first of which is StableLM. Future installments in the series are expected to feature more intricate architectures.
The new StableLM model from Stability AI can perform a comparable set of operations.

StableLM, an open-source language model that can create text and code, was recently released by Stability AI Ltd., an artificial intelligence business.

The venture intends to develop a succession of language models, the first of which is StableLM. Future additions in the series are expected to feature more intricate architectures.

Stability AI, based in London, is supported by USD 101 million in funding. It is best known as the creator of the open-source neural network Stable Diffusion, which can generate images based on text input. A few days before the latest introduction of the StableLM language model, the startup released a significant update to Stable Diffusion.

StableLM is initially available in two versions. The first consists of three billion parameters and the configuration settings determining how a neural network processes data. The second version contains seven billion of these settings.

The more parameters a neural network has, the more tasks it can complete. PaLM, a large language model described by Google LLC last year, is configurable with over 500 billion parameters. It has demonstrated the ability to generate code and text and solve relatively complex mathematical problems.

The new StableLM model from Stability AI can perform comparable operations. However, the startup still needs to disclose specific information regarding the model’s capabilities. Later on, Stability AI intends to publish a technical overview of StableLM.

While the startup did not reveal specific information about StableLM, it did reveal how the model was trained. Stability AI created it using an enhanced version of The Pile, an open-source training dataset. The standard edition of the dataset contains 1.5 trillion tokens, data elements consisting of a few letters each.

StableLM is licensed under the CC BY-SA 4.0 open-source license. The model can be used in research and commercial endeavors, and its code can be modified as needed.

Stability AI stated in a blog post, “We open-source our models to promote transparency and foster trust. Researchers can ‘look under the hood’ to verify performance, work on interpretability techniques, identify potential risks, and help develop safeguards. Organizations across the public and private sectors can adapt (‘fine-tune’) these open-source models for their own applications.”

Stability AI released five StableLM variations trained on datasets other than The Pile. Training a model of artificial intelligence on additional data enables it to incorporate more information into its responses and perform new tasks. The five specialized variants of StableLM might be restricted to use in academic research.

Dolly, a collection of 15,000 chatbot queries and replies, was among the datasets Stability AI used to train the specialized variants of StableLM. Databricks Inc. released Dolly earlier this month. The dataset was used by Databricks to train an advanced language model available under an open-source license, similar to StableLM.

StableLM is in the alpha phase. This is the first language model that Stability AI intends to disclose. The startup plans to create StableLM variants with 15 billion to 65 billion parameters as part of its development plan.

il est temps de devenir sérieux avec le genai dan...

harnessing ai: the future of business transformati...

prepare for the future now. achieve greater, secur...

stay ahead with modern technology...

stay ahead...

workforce upskilling for the ai era...

unlock the full potential of generative ai at work...

ai pcs are quickly becoming the key to achieving s...

developing tomorrow’s ai on today’s ai-ready w...

unveiling ai-level productivity...

the new cyber security opportunity in an ‘ai eve...

how ai is changing managed detection and response...

answering your 4 biggest questions about generativ...

understanding the costs of generative ai...

the top 5 generative ai questions on every executi...

7 leading generative ai use cases...

6 steps to success with generative...

revolutionize your product launches with ai-driven...

unlock the full potential of ai-powered software d...

new era energy efficiency whitepaper longform...

compliance automation: a strategic investment for ...

leading the way: how modern workplaces embrace cha...

choosing the right ai foundation model for your ne...

ai governance: the path to responsible ai...

ai in market research: new possibilities, new insi...

ai ready workforce: upskilling for the ai era...

ai pricing strategy: the key to sustainable busine...

ai in business strategy: enhancing decisions boos...

genai at work: revolutionizing modern business ope...

ai misinformation: ai’s role in amplifying misin...

decision intelligence empowering business actions ...

committee machine in ml harnessing ensemble techni...

information processing language serves scalable an...

ai agents in business: transforming operations dr...

ai adoption framework: key components for effectiv...

machine learning use cases that deliver tangible r...

profitable ai-powered data management solutions to...

business-centric cognitive architecture revolution...

ai use cases – innovations for business success...

the role of ai in software development...

alibaba cloud unveils qwen2.5-omni-7b...

openai upgrades chatgpt’s image generation tool ...

microsoft is improving security copilot service wi...

deepseek unveils enhanced v3 model under mit licen...

nvidia reportedly acquires gretel to generate arti...

dataminr raises usd 85 m for real-time analytics...

ai code review startup graphite raises usd 52 m to...

zoom upgrades with agentic ai for enhanced video c...

google introduces gemini robotics and gemini robot...

google launches next-gen lightweight gemma ai mode...

ai21 labs introduces maestro for enhancing llm qua...

servicenow to acquire moveworks in a usd 2.9 b...

qualcomm acquires edge impulse, edge ai startup...

google introduces two new ai features to enhance i...

coreweave plans to buy weight biases for seamless...

openai launches nextgenai consortium with 15 insti...

anthropic pbc raises usd 3.5 b at usd 61.5 b value...

openai introduces gpt-4.5 as the most advanced and...

amazon launches alexa , an llm-powered assistant...

perplexity ai is creating a browser for ‘agentic...

role of machine learning in networking...

Stability AI Announces the Publication of an Open-source Language Model

Insights Desk

Related posts

Alibaba Cloud Unveils Qwen2.5-Omni-7B...

OpenAI Upgrades ChatGPT’s Image Generation Tool ...

Microsoft is Improving Security Copilot Service wi...

DeepSeek Unveils Enhanced V3 Model Under MIT Licen...

Nvidia Reportedly Acquires Gretel to Generate Arti...

Dataminr Raises USD 85 M for Real-time Analytics...

AI Code Review Startup Graphite Raises USD 52 M to...

Zoom Upgrades with Agentic AI for Enhanced Video C...

Google Introduces Gemini Robotics and Gemini Robot...

Google Launches Next-Gen Lightweight Gemma AI Mode...

Our Brands