Whizzy Ideas

New launch overwhelm; Can AI help and critique; Real time 3D world generation; Economics of AI pricing

I don't normally post the big, obvious news stories here, as there's plenty of sources for those. But... what a week! Look at the sheer number of significant new launches:

Those are just the bigger things. There's many more. So much for having some down time in the summer!

A Treatise on AI Chatbots Undermining the Enlightenment

Initially this is a useful critique of a New York Times opinion piece that follows a common and unhelpful pattern: the author quotes specific interactions they've personally had with a specific AI system and extrapolates widely. Maggie Appleton shows an AI system can readily take on different personas. The more interesting next part examines the problem of a universal chat interface. How can it take all the different roles we need it to (or should need it to)?

How might we accommodate both needs: the generous, informative, helpful assistant and the critical teacher and interlocutor?

She raises important questions. Is it the responsibility of the foundation labs to help you become a better thinker, rather than attempt the thinking for you? Will the more agreeable, borderline sycophantic personas win out in the marketplace, or is there a place for a tool that challenges? I also believe the vast majority of users won't be fine tuning prompts let alone crafting different personas, so in the end whoever controls the default interface will, like Google's first page of search results, have undue influence.

Genie 3: A new frontier for world models

This feels like a big deal that I don't fully understand yet. Google Deepmind are continuing to work on models that effectively simulate 3D worlds that you can navigate around (with no underlying 3D model or game engine). These systems seemed like quirky demos last year, without clear applications. You can't try Genie 3 out for yourself yet, but the demos are remarkable: real time rendering of the next frames, with an apparent ability to "remember" the environment. In early versions of this kind of technology, you'd see an object, look in the other direction, look back, and it would most likely be gone or replaced by something entirely different. In generating the world frame by frame, it is hard for an AI system to keep any continuity. Genie 3 seems to have it solved, for minutes at a time. I am still unclear on the applications. GDM discuss using these generated environments to train AI agents, and that makes sense. But surely there's more.

Veo 3 Just Lost Its Crown to this AI Video Tool

Another recommendation - AI Film News from Curious Refuge is a really detailed roundup with demonstrations of new features and products. This week they discussed Genie 3 but also spent time on the Seedance video generator from ByteDance. They claim it has better results than Veo 3 (to me they look pretty close, but it does score higher on benchmarks).

tokens are getting more expensive

A forthright, opinionated treatise on how people only want the latest, best models, and the latest, best models need more tokens. People prefer a flat rate monthly price, and may not tolerate per-token fees, but that isn't sustainable. If a deep research query costs the AI company $1 but they're charging $20 a month, it doesn't stack up. Worth reading.

In the Future All Food Will Be Cooked in a Microwave, and if You Can’t Deal With That Then You Need to Get Out of the Kitchen

Wonderful skewering of current AI debates :).

#ai-creative #ai-foresight #ai-philosophy