The Year of the Large Language Model
It has been one year since ChatGPT was released. Being in the AI space before then now qualifies as having been “early.” And things still seem to be speeding up.
Although multi-modal capabilities have existed in some fashion for some time (namely, that CLIP embeddings offer a shared latent space for text and images), crossing between modalities seamlessly with a single model has been challenging. No more.
The ChatGPT-4V lineage of models successfully interpret and produce images, which has almost immediately led to some very interesting use cases. Watch the video below from Twitter user @tazsingh to get the sense for what I mean.
How to draw an owl with @tldraw
The first link on deck is about the platform used to make the demo, tldraw.
What’s more, good video generation seems to be right around the corner. Stability AI released SDVideo this month, their model and weights for generating short 1-2s of video. The video at the top of the page of a oceanside mountain framed by clouds was first generated in Dalle3 and then animated with SDVideo. Of a number of samples I tried, this was the best one. It’s not quite there yet, but it’s getting close to being quite good.
Skeptics may argue that LLMs are neither new nor special, and that the hype around them is just that. But the bottom line is that computers are getting new features and abilities that were very out of reach even 4 years ago. There is something happening here.
AI
- make real, the story so far - by Steve Ruiz - tldraw · Really impressed by what they’ve built here. It’s simple and refreshing, and now getting very powerful AI tools.
- Stable Diffusion XL · Image generation is quickly becoming a commodity feature, largely due to Stable Diffusion continuing to release open
- An Interactive Guide to CSS Grid
- LM Studio - Discover, download, and run local LLMs
- IGF - a Hugging Face Space by dylanebert
- PromptIDE
Front End
- Vite - Vite has been a refreshing “just does the thing” type of tool. It’s fast, versatile, and seems to now be the defacto starting point for new web projects. Remix is considering move to a Vite plugin. Astro and Sveltekit are already there.
- Blinded By the Light DOM
- Building towards a new default rendering model for web applications
- Transition Behavior - very cool new property tat I’ve added to my CSS Wiki page
Misc
- Notes toward a pitch
- DAK and the Golden Age of Gadget Catalogs
- Nintendo Company Brochure (2015) – cabel.com
- Wink Optics San Anselmo (formerly Sunglass City)
- Illuminate: An Evening with Rupy C. Tut - ICA SF
- Social heuristics shape intuitive cooperation
- The Eleven Laws of Showrunning
Tech
- Folk Interfaces
- A hypothetical path to the Speakularity / Snarkmarket
- A $7 Subscription
- 2,851 Miles // Bill Gurley (Transcript + Slides)
- Habits of great software engineers
- The Bluffer’s Guide to The Mythical Man-Month
- Threads: The inside story of Meta’s newest social app
- The beauty of finished software | Jose M.
- Confusing git terminology
And one very nice personal website: