Lore Brief
Posts
Lore Issue #50: OpenAI Launches Voice and Multi-Modal Features

Lore Issue #50: OpenAI Launches Voice and Multi-Modal Features

Nathan Lands
September 27, 2023

Lore.com | Agency | Tools | Sign Up

Good morning.

This is the most exciting week we’ve had in AI since the initial launch of ChatGPT. Just in time for our 50th issue!

Key Updates:

OpenAI is rolling out voice and image features for Plus users. This is bringing us dramatically closer to AGI now that AI can actually detect what is in images that it wasn’t even trained on.
Spotify unveils AI-powered podcast transcriptions.
Amazon injects a game-changing $4B into Anthropic, an OpenAI competitor.

The vibe has changed. As those in E/ACC say, it’s time to accelerate.

— Nathan Lands

Corti raised $60M for a medical AI “copilot” (Link)
Kneron raised $49M to develop AI chips to power self-driving cars and other autonomous machines (Link)
Legit Security raised $40M to help businesses protect their AI applications from malicious threat attacks and security breaches (Link)
KYP.ai raised €17.5M for its data-driven insights platform for process automation (Link)
AI data startup Secoda raises $14M (Link)
DynamoFL raised $15M for privacy-focused AI solutions (Link)
Raven AI raised $12M to provide data contextualization software for frontline operations (Link)
Genus AI raised $11M for its generative AI platform for D2C and e-commerce brands (Link)
Qbiq raised $10M for its generative AI and real estate space planning platform (Link)
Paxton AI raised $6M for its law-based AI chatbot (Link)
Catalyte raised $3.2M to develop AI-for-workforce solutions (Link)

Coming up:

AI Mapit is looking to raise funding to digitize real-world infrastructure with AI and geotagging (Link)

Harness the Power of Generative AI with Alpha

Navigating the new era of AI? Alpha is here to guide your journey. As a leading product design agency, Alpha specializes in unlocking generative AI's potential, transforming it into actionable solutions for your business.

Here's what Alpha brings to the table:

Interface Design: Alpha merges aesthetics with functionality, crafting user-centric interfaces that make AI intuitive.
Model Selection: Alpha chooses the ideal AI models for your needs, optimizing performance and efficiency.
Prompt Engineering: They're masters at designing prompts that produce precise, desired outputs.
Fine Tuning: Alpha ensures AI models are perfectly trained and tuned for new domains and custom use-cases.
Configuring API Usage: With Alpha, frequent changes in AI models don't disrupt your product operations.
Data Protection: Alpha prioritizes your privacy, securing personal and sensitive data.
With Alpha's three-step process - understand, build, unleash - you'll not only get innovative AI products, but also continual optimization to improve results over time.
Due to high demand, they’ve got a waitlist. Your future with AI, empowered by Alpha, starts here. Join the waitlist today!

ChatGPT can now see, hear, and speak

OpenAI is rolling out new voice and image capabilities in ChatGPT. These let you have a voice conversation with ChatGPT or show it what you’re talking about.

Amazon invests $4bn in Anthropic

Amazon has agreed to invest up to $4 billion in Anthropic, as part of a broader collaboration to develop reliable and high-performing foundation models.

Getty Images launches an AI-powered image generator

Getty Images has released an AI image generator powered by a Nvidia model trained on its own licensed images.

Customers who create visuals using the tool will receive Getty’s standard royalty-free license which includes protection against copyright lawsuits and the right to “perpetual, worldwide, nonexclusive” use across all media.

Other news

Huge updates to Microsoft Copilot; Personalized AI experiences from Windows (Link)
YouTube releases Dream Screen + other new AI tools (Link)

Spotify’s instant voice translation for podcasts

This is me speaking Spanish, thanks to amazing work by @Spotify AI engineers. The translation & voice-cloning are fully done by AI. Language can create barriers of understanding & thus fuel division. I can't wait for AI to break down this barrier & reveal our common humanity ❤… twitter.com/i/web/status/1…
— Lex Fridman (@lexfridman)
5:40 PM • Sep 25, 2023

Encrypting text in videos

Encrypt your message in a video. Available Now.
Credit:@MatanCohenGrumi
— Pika (@pika_labs)
11:37 AM • Sep 24, 2023

Illusion Diffusion on HuggingFace

Illusion Diffusion is the #1 trending Space on HuggingFace!
Thank you to everyone who contributed and are playing with it 🥳
— AP (@angrypenguinPNG)
5:27 PM • Sep 23, 2023

Simulating game actions in realtime with GPT-4

LFG!!! 🫡 finally finished devlog #0
all actions & dialogue for every character is fully simulated, in real-time, with @OpenAI's GPT-4
and I condensed all the work so far into a 2-minute explainer, just for you 😉
— Harris Rothaermel (@DeveloperHarris)
4:48 PM • Sep 24, 2023

Wonder Studio: An AI tool that automatically animates, lights and composes CG characters into a live-action scene (Link)
Pictory: Produce a month of video and social content from long-form content in minutes (Link)
QR Craft: Create artistic QR codes (Link)
Truewind: AI-powered Bookkeeping and Financial Modeling for Startups (Link)
BuildBox: Create video games with AI (Link)

Find more tools at FutureTools.io.

Multimodal Foundation Models: From Specialists to General-Purpose Assistants (Link)
A Large-scale Dataset for Audio-Language Representation Learning (Link)
Small-scale proxies for large-scale Transformer training instabilities (Link)
DeepSpeed-VisualChat: Multi-Round Multi-Image Interleave Chat via Multi-Modal Causal Attention (Link)

That's it!

See you next week!

-Nathan Lands

If you want to support Lore, please:

Share this newsletter
Follow us on Twitter @NathanLands & LoreAiNews.
Join our Generative AI community on LinkedIn.