Verses Over Variables

Your guide to the most intriguing developments in AI

Welcome to Verses Over Variables, a newsletter exploring the world of artificial intelligence (AI) and its influence on our society, culture, and perception of reality.

AI Hype Cycle

OpenAI’s Policy Agenda: Regulate or Renegade?

Last week, we dissected Anthropic's thoughtful proposals for AI governance, a plan that felt like a well-reasoned appeal for AI safety and responsible innovation. This week, it's OpenAI's turn at the policy podium. The company has released its own set of recommendations for the U.S. AI Action Plan, submitted to the White House Office of Science and Technology Policy (OSTP). Buckle up, because their approach is less of a gentle nudge and more of a full-on policy proposal.

If Anthropic's proposal was a carefully curated playlist of AI safety hits, OpenAI's feels more like a bold (perhaps even slightly chaotic) remix. The core message is to keep the innovation flowing, but also put some guardrails up for those AI models that start looking a little too powerful. (Think less "helpful assistant" and more "Skynet in beta.")

One of the most striking proposals is a new licensing regime for the big companies of AI — mandatory registration and licensing for AI models that cross a certain capability threshold. The idea is to create a system of oversight and accountability for the most advanced systems. Think of it like getting a driver's license for your AI – you need to prove you know how to handle that much power before you unleash it on the open road.

OpenAI also pushes for independent audits to assess the safety, security, and societal impact of these advanced AI systems. Because let's face it, nobody wants a repeat of those early chatbot days when our AI friends were spitting out everything from conspiracy theories to unintentionally (or intentionally) offensive content. Regular check-ups and safety standards sound like a better plan.

OpenAI is also suggesting a federal framework that would essentially override individual state laws on AI. Now, on one hand, this could streamline things and create a more uniform approach to AI governance. On the other hand, it raises questions about federal overreach and the potential for stifling innovation at the local level.

And it's not just domestic policy OpenAI's concerned with. They've also directly addressed the rise of Chinese AI, specifically calling out the Chinese AI lab DeepSeek as a potential security risk. They've even gone so far as to suggest restrictions, potentially even a ban, on AI models developed by organizations affiliated with the People's Republic of China in allied countries, arguing that concerns about data access, state control, and the potential for malicious use warrant such measures. It's a move that could significantly reshape the global AI landscape, turning the competition into an outright cold war.

OpenAI also touches on export controls (keeping sensitive AI tech out of the wrong hands), infrastructure investment (because you can't build the future on dial-up), and intellectual property (AI and copyright – a never-ending saga). They even suggest modernizing government operations with AI tools. (Imagine an AI-powered DMV).

OpenAI seems to be walking a tightrope, trying to balance the need for responsible AI development with a desire to maintain America's competitive edge. It's a complex dance, and it's not clear whether they'll stick the landing. What is clear is that the AI policy conversation is heating up.

Back to Basics

Beyond ChatGPT: Learning the Power of OpenAI’s APIs

We're not engineers, and we definitely don't want to get lost in the technical weeds, but this week the tech world was buzzing about AI infrastructure. Apparently, OpenAI and Anthropic are in a bit of a race to set the standard for how AI models interact with other digital services, and the latest contender to enter the arena came in the form of a new API from OpenAI.

If you're just starting to explore the world of AI, you might be wondering, "What's an API?" Think of it as a digital bridge, connecting different software applications and allowing them to share information and capabilities. In our case, it's how your projects can tap into the AI models developed by OpenAI, Anthropic and others.

Let's take a look at all three of OpenAIs API offerings – the Chat Completions API, the Assistants API, and the brand-new Responses API – to see what each brings to the table and how they can inspire your next creative project.

  1. Chat Completions API - This is your go-to for straightforward conversational interactions. You provide a message, and it responds with a single, coherent reply. It's perfect for building simple chatbots that can answer frequently asked questions, generate creative content, or provide personalized product recommendations. Think of it as the reliable brainstorming partner who's always ready with a quick idea.

  2. Assistant API - Offers a more advanced set of features. This one remembers the context of your conversations, allowing for richer, more nuanced interactions. It also has built-in tools for tasks like searching the web, analyzing files, and even writing code. Need a virtual assistant to help you research a topic, analyze data for a marketing campaign, or even generate code snippets for a new website? The Assistant API is your creative ally. Imagine it as your resourceful and intelligent project manager, keeping everything organized and on track.

  3. Responses API - OpenAI is positioning this as a versatile option that combines the simplicity of Chat Completions with some of the advanced capabilities of the Assistant API. It's designed to be easier to use for many tasks while still offering the power to leverage tools and potentially even run code. This opens up a wide range of possibilities, from automating accounting tasks and streamlining customer support to integrating real-time data into financial analysis and enabling AI-powered agents to interact with GUIs to automate tasks. Think of it as the Swiss Army knife of OpenAI's API family – adaptable, resourceful, and ready to tackle a variety of creative challenges.

Why should we care (considering most of us are not developers)? Even if you're not writing code yourself, OpenAI APIs can significantly impact your digital experiences:

  • Improved User Experiences: Expect more intuitive, responsive chatbots, personalized recommendations, and seamless support for multiple languages in the apps and services you use daily.

  • Automation and Efficiency: Businesses are using these APIs to automate tasks like customer support, content creation, and data analysis. This means faster, more consistent service for you.

  • Innovation and Creativity: OpenAI APIs empower developers to create innovative features and functions, leading to more engaging, efficient, and even delightful applications.

  • Accessibility and Personalization: Look forward to more personalized experiences based on your preferences and behavior, from better recommendations to more relevant content.

  • Enhanced Security and Reliability: By automating testing and debugging, OpenAI APIs can help improve the security and reliability of the apps you rely on.

As we look to the future, expect APIs to become even more powerful and seamlessly integrated into our lives. We'll likely see more APIs incorporating AI directly, leading to more intelligent and automated interactions between applications. The rise of low-code and no-code platforms will make it easier than ever for non-developers to connect and use APIs, further democratizing access to these powerful tools. We can also anticipate a surge in specialized APIs tailored to specific industries and use cases, offering even more granular control and customization. And, of course, new APIs will emerge to connect the physical and digital worlds, enabling innovative experiences in the metaverse and Web3. The more you understand APIs, the better equipped you'll be to navigate and shape the future of technology.

MCPs: The Universal Translator for AI? Or Just Another API in the Crowd?

You've probably been watching the AI landscape evolve faster than a chameleon in a disco. Remember when LLMs were just glorified autocomplete? Now they're running third-party services, seemingly building Rome in a day. This week, something called the Model Context Protocol (MCP) exploded onto the scene, and everyone from developers to creatives seems to be riding the wave. Let’s explore what all the commotion is about, and discuss why you should even care.

As discussed above about OpenAI's APIs, APIs are the backbone of modern AI, allowing different software systems to communicate and exchange data. But MCPs, or Model Context Protocols, are aiming to take that concept a step further. Instead of just being a connection point, think of MCPs as a standardized set of rules for those APIs – a universal translator for AI agents and other software systems. It's like establishing a common language so that any AI can easily "talk" to any service, without needing a PhD in each individual API's nuances. Anthropic introduced this open standard on November 25, 2024, and the goal is that the community contributes to MCP.

Integrating AI with different data sources is often a coding nightmare, requiring bespoke integrations for every tool you wanted to use — MCP aims to fix that. The idea is that, instead of building custom connections, MCP provides a standardized way for APIs to describe how AI can interact with their systems. An AI agent adhering to the MCP standard should be able to understand, "Okay, this API lets me search for images," or "This API allows me to create a blog post," without needing a human to hand-hold it through the process.

MCPs promise to potentially to streamline your workflow and unlock new creative possibilities. We’ve seen examples of MCPs using Claude to i) talk directly to the 3D modeling software Blender - generate a 3D image from a prompt instead of having to learn the complicated software; ii) clone websites with a prompt using Firecrawl; iii) integrate academic databases; iv) access and utilize any data source or service without needing custom code; v) manage Google Drive and Slack. It’s a bit like the vibe coding trend, but instead of relying on carefully crafted prompts, you’re relying on standardized protocol to ensure seamless integration. (And the best part, the third party provider is responsible for maintaining it.)

MCPs could be a game-changer, simplifying AI integration and unlocking a new wave of creative applications. Or, they could be another overhyped tech trend that fades into obscurity. Only time will tell. But given the potential benefits, it's definitely something worth keeping an eye on.

Intriguing Stories

Robots Reimagined: Google DeepMind’s Gemini Robotics

The robot revolution has been promised for decades, but practical applications often feel like a distant dream. We’re still waiting on those jetpacks and self-folding laundry machines, but Google DeepMind has unveiled something that might actually change the game: Gemini Robotics.

Forget the clunky, pre-programmed automatons of yesteryear. These robots have brains – or, more accurately, the robots have access to Google's powerful Gemini 2.0 AI. Imagine a robot that doesn't just blindly follow instructions, but understands them. A robot that can adapt to a messy, unpredictable real world. DeepMind isn't just throwing AI at robots and hoping for the best; they've created two models working in tandem. Gemini Robotics is a "vision-language-action" (VLA) model – it sees, understands natural language, and translates that into action. Then, Gemini Robotics-ER provides advanced spatial understanding, figuring out where objects are, how they can be grasped, and how to plan a safe path. Gemini Robotics handles the "what" and "how," while Gemini Robotics-ER focuses on the "where" and "why." Together, they're making robots a lot smarter.

DeepMind is already working with robotics companies to test real-world applications. Imagine robots assisting surgeons, streamlining logistics, helping around the house, or performing precision manufacturing. DeepMind even showed off a robot that could perform a slam dunk with a toy basketball – despite never having seen anything basketball-related before. That's the power of Gemini's "world understanding" at work.

Of course, with great power comes great responsibility (thanks, Uncle Ben). DeepMind is taking robot safety seriously, incorporating Google AI Principles, using semantic safety checks, preventing bias, and experimenting with a "robot constitution" to steer the model away from unsafe or unethical tasks.

Gemini Robotics represents a major leap forward in AI-powered automation. It's not just about building robots that can do things, but building robots that can understand, adapt, and collaborate.

Russo Brothers Reboot Hollywood (Again)

Anthony and Joe Russo, the dynamic duo behind Marvel’s “Avengers: Endgame,” are once again poised to revolutionize Hollywood. This time, they’re merging artificial intelligence with traditional filmmaking to create a new paradigm in storytelling. Their latest venture, a high-tech studio, aims to empower artists by developing AI tools that facilitate “transmedia” storytelling—crafting narratives that seamlessly traverse films, games, and other media platforms.  

Transmedia storytelling isn’t just a buzzword; it’s the future of immersive entertainment. By creating reusable digital assets, the Russos envision a world where characters and storylines fluidly move across various media, offering audiences a richer, more interconnected experience. Imagine a character introduced in a film who later appears in a video game, with their storyline expanding in a graphic novel—all maintaining narrative coherence. AI plays a pivotal role here, assisting in the generation and organization of vast amounts of narrative information, ensuring consistency and continuity throughout the storyworld.  

The Russos’ commitment to integrating AI is evident in their strategic hires. They’ve brought on Dominic Hughes, a former Apple AI expert, as the Chief Scientific Officer of their production company, AGBO. This move underscores their dedication to developing in-house AI technologies tailored to enhance the creative process, ensuring that artists remain at the helm of innovation.  However, this technological leap isn’t without its challenges. The ethical implications of AI in creative decision-making are a hot topic. Concerns range from the potential displacement of human creativity to issues surrounding intellectual property rights. The Russos acknowledge these challenges but remain optimistic, emphasizing that AI should serve as a tool to enhance artistry, not replace it, and that “Artists need to lead the innovation, so that the technology is created to sere the artists, and not vice versa.”

In essence, the Russo brothers are not just directing films; they’re directing the future of storytelling. By embracing AI and transmedia narratives, they’re crafting a cinematic universe where technology and creativity coalesce, offering audiences experiences that are as interconnected as the digital world we inhabit.

— Lauren Eve Cantor

thanks for reading!

if someone sent this to you or you haven’t done so yet, please sign up so you never miss an issue.

if you’d like to chat further about opportunities or interest in AI, please feel free to reply.

if you have any feedback or want to engage with any of the topics discussed in Verses Over Variables, please feel free to reply to this email.

banner images created with Midjourney.