OctoML Introduces a Method for Mass-customizing AI Image Gen Model

Published by: Insights Desk Released: Nov 09, 2023 Source: DemandTalk

Highlights:

A new “Asset Orchestrator” at the heart of the release will enable developers to fine-tune their models using assets like Low-Rank Adaptations, or LoRAs.
The new service, OctoML claims, can produce art generations in an average of 2.8 seconds, significantly speeding up image production with the photo-realistic model Stable Diffusion XL.

Recently, OctoML Inc., a startup specializing in artificial intelligence optimization, announced the release of OctoAI Image Gen. This architecture enables developers to customize image generation on well-known models like Stable Diffusion and simultaneously apply changes to thousands of assets.

Luis Ceze, Chief Executive of OctoML, said, “Image generation applications have quickly gone from fad to real business, with many e-commerce, entertainment and creative organizations looking to differentiate their service with AI. But building these custom experiences with Stable Diffusion today is an extensive engineering effort that simply doesn’t scale.”

In June, OctoAI was introduced to assist developers in creating and expanding their artificial intelligence models. With the addition of this new offering, it can now offer an API endpoint and enable mass fine-tuning with its resources.

The new “Asset Orchestrator,” which is the centerpiece of the release, will enable developers to enhance their models with assets like Low-Rank Adaptations, or LoRAs. Using a LoRA, users can quickly train Stable Diffusion on various concepts, like a specific character or style. LoRAs are fine-tuning models.

Unlike standard image-generating models, which can be cumbersome due to their size, LoRAs generate small portable models, making them useful. Due to their reduced processing power requirements, they are also far faster and simpler to train.

A LoRA can enhance a Stable Diffusion model once it has been trained to produce an image with that particular character or style. Therefore, for fine-tuning a model, LoRAs represent a reasonable trade-off in terms of size, time, and computing power.

Users can prompt a Stable Diffusion model with text to create an image of a video game character or comic book character, for instance. This would probably lead to finicky and inconsistent results — and would probably require a lot of trying to get the model to manifest the image they want. This is called prompt engineering.

A LoRA trained on pictures of that particular character and styles the user desired, like from a certain video game or art style from a certain era of comic books, would more accurately align the model to match the intended results. A lot less engineering would be needed for the model to produce a reasonably good customized image.

Using OctoML’s photo-realistic model Stable Diffusion XL, the new service produces art generation on average in 2.8 seconds, according to OctoML. As part of the asset management feature, users can manage and pull models and data from popular sources like CivitAI, an open-source tool where users can share AI artwork from Stable Diffusion, and Hugging Face Inc.’s open-source AI model repository.

Numerous clients, such as Storytime AI, which creates an app that employs AI to create children’s stories, and NightCafe Studio Ply Ltd., which operates an AI art generator website and community, have already implemented the OctAI image generation solution in their business applications.

Brian Carlson, CEO of Storytime AI, said, “Our top priority is to deliver kid-safe, consistent, engaging images for our custom children’s stories. Previously, this process relied on heavy-handed prompt engineering. But OctoAI helped us stand up a whole new image gen architecture utilizing assets like LoRAs to create consistent visuals without the added complexity of prompt engineering.”

il est temps de devenir sérieux avec le genai dan...

harnessing ai: the future of business transformati...

prepare for the future now. achieve greater, secur...

stay ahead with modern technology...

stay ahead...

workforce upskilling for the ai era...

unlock the full potential of generative ai at work...

ai pcs are quickly becoming the key to achieving s...

developing tomorrow’s ai on today’s ai-ready w...

unveiling ai-level productivity...

the new cyber security opportunity in an ‘ai eve...

how ai is changing managed detection and response...

answering your 4 biggest questions about generativ...

understanding the costs of generative ai...

the top 5 generative ai questions on every executi...

7 leading generative ai use cases...

6 steps to success with generative...

revolutionize your product launches with ai-driven...

unlock the full potential of ai-powered software d...

new era energy efficiency whitepaper longform...

compliance automation: a strategic investment for ...

leading the way: how modern workplaces embrace cha...

choosing the right ai foundation model for your ne...

ai governance: the path to responsible ai...

ai in market research: new possibilities, new insi...

ai ready workforce: upskilling for the ai era...

ai pricing strategy: the key to sustainable busine...

ai in business strategy: enhancing decisions boos...

genai at work: revolutionizing modern business ope...

ai misinformation: ai’s role in amplifying misin...

decision intelligence empowering business actions ...

committee machine in ml harnessing ensemble techni...

information processing language serves scalable an...

ai agents in business: transforming operations dr...

ai adoption framework: key components for effectiv...

machine learning use cases that deliver tangible r...

profitable ai-powered data management solutions to...

business-centric cognitive architecture revolution...

ai use cases – innovations for business success...

the role of ai in software development...

alibaba cloud unveils qwen2.5-omni-7b...

openai upgrades chatgpt’s image generation tool ...

microsoft is improving security copilot service wi...

deepseek unveils enhanced v3 model under mit licen...

nvidia reportedly acquires gretel to generate arti...

dataminr raises usd 85 m for real-time analytics...

ai code review startup graphite raises usd 52 m to...

zoom upgrades with agentic ai for enhanced video c...

google introduces gemini robotics and gemini robot...

google launches next-gen lightweight gemma ai mode...

ai21 labs introduces maestro for enhancing llm qua...

servicenow to acquire moveworks in a usd 2.9 b...

qualcomm acquires edge impulse, edge ai startup...

google introduces two new ai features to enhance i...

coreweave plans to buy weight biases for seamless...

openai launches nextgenai consortium with 15 insti...

anthropic pbc raises usd 3.5 b at usd 61.5 b value...

openai introduces gpt-4.5 as the most advanced and...

amazon launches alexa , an llm-powered assistant...

perplexity ai is creating a browser for ‘agentic...

role of machine learning in networking...

OctoML Introduces a Method for Mass-customizing AI Image Gen Model

Highlights:

Insights Desk

Related posts

Alibaba Cloud Unveils Qwen2.5-Omni-7B...

OpenAI Upgrades ChatGPT’s Image Generation Tool ...

Microsoft is Improving Security Copilot Service wi...

DeepSeek Unveils Enhanced V3 Model Under MIT Licen...

Nvidia Reportedly Acquires Gretel to Generate Arti...

Dataminr Raises USD 85 M for Real-time Analytics...

AI Code Review Startup Graphite Raises USD 52 M to...

Zoom Upgrades with Agentic AI for Enhanced Video C...

Google Introduces Gemini Robotics and Gemini Robot...

Google Launches Next-Gen Lightweight Gemma AI Mode...

Our Brands