- The Intelligent Worker
- Posts
- 🎤 OpenAI's biggest update
🎤 OpenAI's biggest update
👨💻 OpenAI ships real-time API and vision FT + ✈️ Microsoft Copilot gets an overhaul
Hi everyone,
OpenAI DevDay was yesterday, where they unveiled a host of amazing developer tools. More details below, but imagine using ChatGPT’s real-time voice assistant in your own apps.
Also, Microsoft Copilot gets voice and other upgrades.
Let’s get right into it.
As always, please support our partner Writer.
In this issue:
🤝In Partnership: Simplify AI app building with Writer
🤿Deep Dive: OpenAI's Developer Day unveils new tools
🖼️AI Art: Examples of great and trending AI art
🤿Deep Dive: New features of Microsoft Copilot
⚒️Tool Snapshots: Tools for AI, no-code, and productivity
📰Top News: News on AI, no-code, and productivity
🤝IN PARTNERSHIP WITH WRITER
The fastest way to build AI apps
Writer Framework: build Python apps with drag-and-drop UI
API and SDKs to integrate into your codebase
Intuitive no-code tools for business users
🤿 DEEP DIVE
OpenAI's Developer Day Reveals New Tools Amidst Executive Changes and Competitive Challenges
Intelligence: Amidst a turbulent week characterized by executive departures and fundraising developments, OpenAI unveiled several new tools at its 2024 DevDay, including a public beta of its “Realtime API” for low-latency, AI-generated voice responses.
Source:OpenAI
The new Realtime API enables developers to create nearly real-time, speech-to-speech experiences in apps with six voices provided by OpenAI.
OpenAI also introduced vision fine-tuning in its API, allowing developers to use images and text to adjust applications of GPT-4o.
The company has launched a model distillation feature, enabling developers to use larger AI models to fine-tune smaller ones, potentially improving performance and reducing costs.
Significance: OpenAI ships more features as more executives leave the firm. I’m also very happy that they are giving developers all these features through an API, and not keeping it to themselves.
🖼️ AI ART
Examples of great and trending AI art
"Who Will Dominate the Lord of the Rings Wrestling Arena?" by Sudden_Relative_5439, created with Midjourney
🤿 DEEP DIVE
Microsoft Copilot's New Features and Privacy Concerns
Intelligence: Microsoft is launching new capabilities to its AI-powered product, Copilot, including a tool that can understand and respond to what's on users' screens, despite ongoing privacy concerns.
The new Copilot Vision can analyze text and images on web pages and answer queries about them. It is currently limited to interpreting specific types of websites, excluding paywalled and "sensitive" content.
Microsoft insists that Copilot Vision was designed to delete data immediately following conversations and that it won't store or use processed audio, images, or text to train models.
The Think Deeper feature allows Copilot to reason through more complex problems using "reasoning models" from OpenAI, fine-tuned by Microsoft.
Copilot Voice allows users to talk to Copilot and have its responses spoken aloud.
Significance: This demonstrates Microsoft's push to improve the functionality and user experience of AI tools. This also shows that a careful balance between innovation and privacy continues to be a critical issue in the AI industry.
⚒️ TOOL SNAPSHOTS
Futuristic tools within AI, no-code, and productivity
⚡ Thunderbit - Your 1-click AI web task automation tool. Free option available.
📋 Hoop - AI-powered task consolidation from meetings to Slack channels. Free to try.
🤖 Future AGI - Streamline AI model QA with custom Critique Agents. Free to try.
💼 interview.co - Streamlines interviewing for superior efficiency and enjoyment. Free to try.
📊Hubmee - Your comprehensive personal and household manager. Payment required.
🗣️ Wispr Flow - Triple speed, natural-style dictation app for Mac. Payment required.
📰 TOP NEWS
News on AI, no-code, automation, and productivity
Raspberry Pi launched an AI camera for $70, which is designed to work with all Pi models, features Sony’s IMX500 image sensor, and integrates an AI accelerator that can run varied neural network models with low power consumption and latency, thus freeing up the device's processor for other tasks.
Pinterest has launched generative AI features in its Pinterest Performance+ suite, enabling advertisers to transform the backgrounds of product ads into lifestyle imagery, which statistically increases clickthrough rates, decreases cost-per-click, and significantly boosts conversion rates as per the successful early adoption from Walgreens.
In a discreet part of a recent overhaul update, Microsoft revealed that it’s integrating its chatbot, Copilot, into WhatsApp, and it's proving to be an interesting (and a bit Dutch) but enjoyable endeavour.
Google's new AI podcast tool, NotebookLM, turns text into impressively lifelike audio, and as seen in its outstanding early application in creating stunningly real podcast discussions, it significantly reduces barriers to content production, however, the implications of misused or error-prone AI in communication are terrifying.
Microsoft’s finance leases, primarily for data center construction to handle the heavy load of AI workloads, have surged to more than $100 billion, signaling their aggressive capital investments toward AI advancements while setting the stage for a potential impact on profitability.
ℹ️ ABOUT US
The Intelligent Worker helps you to be more productive at work with AI, automation, no-code, and other technologies.
We like real, practical, and tangible use-cases and hate hand-wavy, theoretical, and abstract concepts that don’t drive real-world outcomes.
Our mission is to empower individuals, boost their productivity, and future-proof their careers.
We read all your comments - please provide your feedback!
Did you like today's email?Your feedback is more valuable to us than coffee on a Monday morning! |
What more do you want to see in this newsletter?Please vote |