Token Talk 34: The Slopapocalypse

October 30, 2025

By: Thomas Stahura

Well… Sora 2 dropped last week, and the internet collectively lost its mind — again.

In a matter of hours, my X feed turned into a cinematic fever dream: F1 office chair racing, CCTV footage of Sam Altman stealing GPUs, and Rick and Morty explaining gaussian splatting. I scrolled through it with a combination of awe and horror. These videos have got synced audio, consistent characters, and physics that actually make sense.

Obviously, I had to try it myself. Problem was, Sora 2 is invite‑only. So I frantically copy and pasted every invite code that was being sent in the OpenAI discord until boom, access granted. Within minutes I was typing Rick and Morty prompts and 90 seconds after that, I was staring at them.

Then I found out about the cameo feature, which (after a short verification process) lets you make clips of yourself and any other verified Sora user. So naturally, Sam Altman became Sora’s star character. Suddenly he was Skibidi Toilet, stealing art from Hayao Miyazaki, and wandering an empty Walmart at 3am.

Sora 2 is a world model, and I believe, a new type of medium as well. It seems almost inevitable that within two years AI video will be table stakes for every social media platform. Google is adding more Veo3 features to Youtube. And Meta already launched Vibes (though its videos aren’t as impressive since Zuck is using third party models).

Heck, two years ago my friends and I also built an AI social media app (which we called Quasi Fiction). Back then, AI video was bad and the “cameos” required fine tuning. So we leaned into

hallucinations, focussing on short fictional stories and comic books, and built a modest audience of a few hundred users.

But fast forward to today, and Quasi Fiction doesn’t feel like fiction anymore.

With Sora 2 there is no meaningful distinction between fictional media and media that just exists. Especially with cameos in our influencer-addled world. The interface feels like TikTok. And the pace of creation is absurd. So where do we go from here?

I see folks on Sora scrambling to grow their accounts before the masses join the app. Which may not even matter in the long run since the algorithm will pump out personalized content faster and better than people can.

That’s why we’ve entered what I dub the slopapocalypse.

No, not because all of what Sora 2 generates is “slop”, but because some of it is actually good. We're now optimizing for engagement in a world where reality is increasingly optional and trust is already fragile. In that world, the problem is genuine human creativity getting lost in the slop, meaning content curation and verification become increasingly important. Ultimately though, in an endless sea of synthetic content, the real challenge will be getting us to care.

So stay tuned!

Token Talk 33: Survival of the Fittest Architecture

October 9, 2025

By: Thomas Stahura

I remember getting GPT-2 to write bad poetry on my Asus Zenbook Duo gaming laptop. The fan spun up like a tiny jet engine, the 8GB of GPU memory maxed out, and after a few minutes, it produced something that vaguely rhymed. It felt magical. But that was five years ago. Today, running a frontier model requires data centers – or so it seems.

The story of how we got from there to here is really about the constant co-evolution of silicon and software, especially during the Transformer era.

Which brings me back to January 23, 2020. The day the Scaling Law paper proclaimed models with more parameters, trained on more data, produced better results.

Beginning AI’s brute force era, where dense models ruled and parameter counts climbed. About 500 fold in the case of GPT-3’s 175 billion versus GPT-2 mediums’s 355 million parameters. And since these are dense models, every single one of those parameters is activated for each token generated. Meaning astronomical compute costs.

Those design choices then set the hardware agenda. The game was VRAM capacity. The challenge was simply fitting the model’s weights into memory. That left the system profoundly memory-bandwidth-bound. Arithmetic intensity, the ratio of compute to memory work, was already low, which means the model spends more time waiting on high‑bandwidth memory (HBM) than computing.

So models kept evolving: FlashAttention reduced off‑chip reads and writes. Multi‑query and grouped‑query attention cut key‑value (KV) duplication. While quantization decreased weight footprints down to 8, 4, and even 1.58 bits without crushing quality too much. Still, the physics didn’t change. At full precision, dense models demanded big HBM, big bandwidth, and strong tensor cores mostly to hide memory stalls.

Enter Mixture of Experts. Which decouple total parameters from active parameters. So instead of activating all parameters, a router network selects a specialized subset of the parameters (experts) for each token.

As a result, memory bandwidth per token drops since fewer parameters are needed during inference. The new tax becomes communication. Experts are sharded across GPUs, sometimes across nodes. Tokens get scattered to the right experts, computed, then gathered back every layer.

But even with MoE lightening the bandwidth load, the optimizations continue to the decode loop — the part of inference that runs over and over, token by token.

Speculative decoding improves that loop: A smaller draft model proposes several tokens ahead, and the larger target model verifies those tokens in a single teacher-forced pass, accepting the longest matching prefix and reusing the computed KV state for the next step.

That swap turns many tiny, memory-bound decode steps into one larger pass per layer, which improves weight reuse, raises arithmetic intensity, and trims per-token HBM traffic. Ultimately speeding up inference while averaging model performance between both draft and teacher models.

The payoff depends on acceptance rate (and temperature). As such, modern runtimes co-schedule draft and target on the same node, keep KV in HBM, and pipeline the accept-reject loop with persistent kernels to avoid stalls. This is co-evolution in practice, shifting the bottleneck from bandwidth and fabric back toward on-chip compute when it helps, and back again when routing dominates.

It’s easy to imagine a future where these optimizations fade into the background, just as few of us think about TCP packets when we stream Netflix. In that world, AI will feel weightless and who knows, maybe even light enough to run on laptops again.

So stay tuned!

Token Talk 32: Technological Telepathy

September 25, 2025

By: Thomas Stahura

How many times has the perfect sentence slipped away in the second it takes to articulate that thought?

Our interactions with computers evolved from punch cards to command lines, mice to touchscreens, and typing to talking, each step more natural than the last. Now, we stand on the brink of the ultimate leap: a direct brain-computer interface — AKA the closest thing we’ll experience to telepathy.

Right now, as you read this, the roughly eighty-seven billion neurons inside your skull are chemically sparking across synapses at hundreds of miles an hour. Every letter scan triggers a cascade of action potentials. Out of the statistical roar forms a pattern, and the pattern encodes a thought. At least that's how the thinking goes.

Brains are still stumped on how brains work.

The core problem for any BCI, then, is getting and interpreting these signals. And the most common way to do so is via Electroencephalography (EEG). The path taken by companies like Emotiv, Neurable, and the open-source OpenBCI. These non-invasive devices rely on placing an array of electrodes on the scalp to measure the sum total of brain waves leaking through the cranium. The skull, however, acts as both insulator and filter, blurring the crisp detail of the original neural activity.

For better connectivity there are always invasive methods, such as ECoG grids or Neuralink’s fine wire arrays, which surgically insert electrodes directly into or onto brain tissue. The arrays then pick up individual neuron firings or local field potentials, delivering ultra-high-resolution brain activations at the cost of invasive brain surgery. But the technology remains confined to specialized clinical trials.

(We have an investment in the BCI space operating in stealth.)

Still, capturing brain data is only half the battle. It has to be cleaned, labeled, and turned into intent. The pipeline starts with applying Independent Component Analysis (ICA) to separate the mixed EEG signals into independent muscle movements, eye blinks, unconscious activity, and genuine thought.

From there, features are extracted like power and phase information (waves) from the classic EEG frequency bands: Delta (0.5–4 Hz), Theta (4–8 Hz), Alpha (8–13 Hz), Beta (13–30 Hz), and Gamma (30–100 Hz). Where Delta and Theta waves often track deep sleep, drowsiness, or meditative states. Alpha is linked to relaxed wakefulness and idle focus. Beta signals active thinking, concentration, and problem-solving, and Gamma is the realm of higher-order cognition.

Feed these bands into neural nets that learn to map, say, a sustained increase in beta over the motor cortex to "move the cursor left," or a distinct alpha suppression to "select item."

The frontier though is interpreting highly nuanced commands — intricate language, ideas, and emotions

Researchers are using invasive ECoG grids to do this by intercepting the brain's motor commands for speech, and translating those intended phonemes into synthesized words in real-time. The visual equivalent is even wilder. By scanning the visual cortex with fMRI and feeding that data to generative AI models like Stable Diffusion, scientists can reconstruct ghostly, but recognizable, images of what a person is seeing.

Both are lab confined, but prove a critical point: the once-unthinkable is now fundamentally a data science and engineering problem. The question is no longer if we can build a co-processor for the mind, but how quickly, and what happens when we do – so stay tuned!

My 8-channel OpenBCI EEG in action: On the left, each colored trace visualizes live electrical activity from an electrode on my scalp. The center widget translates that neural chaos into a real-time “Concentrating” score based on simple regression. On the right, you see the breakdown of my brain’s activity into the classic EEG frequency bands (delta, theta, alpha, beta, and gamma.) Below, a timeline graph plots my concentration rising and falling over the last few seconds.

Token Talk 31: Are we in a bubble?

September 4, 2025

SAN FRANCISCO — These last weeks found me off the grid, cruising the Pacific Coast Highway toward San Francisco. The vistas were awesome: Oregon’s cliffs battered by ocean winds, Northern California’s beaches stretching into myth, and the redwoods towering in regal silence. I couldn't help but take in the beautiful landscapes and, of course, think about AI (and the current state of the world).

Once by the Bay, I took the train over to Berkeley to hang out with my high school buddy Jakob at Arcadia house. The same hacker house I was at last year with my good friend Jack.

If you remember, the frontier in 2024 was agents. So that summer Jack and I worked on building our own browser agent. The air was electric! Using JSON tool calls, our AI could interact with the internet and complete rudimentary tasks. It felt like only a matter of time before even operating systems would be replaced with an AI that controls your entire computer. Still, it was clear a lot more work needed to be done. Vision models were not accurate enough to generate precise UI x,y coordinates for mouse movements. Tool calling was nascent and prompting posed a constant challenge — getting the model to output the right tool call in the correct format.

Fast forward to today, and these problems persist, but at a lower rate. Frameworks like browser-use, launched back in February, abstract away prompting and tool parsing. VLMs, big and small, have gotten much better at pointing. And MCP “solved” tooling.

Although agents are being used now by businesses more than ever, there are still very few consumer use cases. I suppose that's fine as long as they’re working better and making money. But I can’t help and feel underwhelmed by the still significant progress of agents.

The vibes at Arcadia this year feel a little more “whelmed.” Don’t get me wrong, there is still much excitement in the air, but the runaway, exponential, intelligence explosion that was promised is canceled. And in its place, a slower, more gradual, sustainable, rate of improvement has been installed. The scene now feels more research focused, as if everyone realized the low hanging fruit has been picked. Now the real work begins.

This is not the vibe you’d get if you strictly based your perception on media reports. PBS, CNN, and Fortune all wrote about AI being in a bubble. To that extent, I suppose, I’m no different.

But I have a different take.

Nearly all articles on this topic reference MIT’s state of AI in business study (July 2025) and Mckinsey’s state of AI study (June 2025), so let's dive in because these headlines are a bit misleading.

First and foremost, it is true that the MIT study states: “Despite $30–40 billion in enterprise investment into GenAI, this report uncovers a surprising result in that 95% of organizations are getting zero return.”

However, the authors later acquiesce: “While official enterprise initiatives remain stuck on the wrong side of the GenAI Divide, employees are already crossing it through personal AI tools.”

Adding: “While only 40% of companies say they purchased an official LLM subscription, workers from over 90% of the companies we surveyed reported regular use of personal AI tools for work tasks. In fact, almost every single person used an LLM in some form for their work.”

So, employees are using services like ChatGPT or Gemini to augment their work rather than the official enterprise AI initiatives. That seems like enterprise traction, albeit from a more ground-up approach.

Additionally, to collect their data, researchers asked the following question to business leaders: How many GenAI pilots have been launched since Jan 2024?

January 2024 is months before o1-preview (the first reasoning model) hit the scene in September 2024! Imagine coding with AI before reasoning. Obviously some of those pilots failed — the models weren't good enough yet!

Mckinsey’s survey was also conducted before reasoning models (July 16 to July 31 2024) and claims only 20% of enterprises are seeing measurable increased profits as a result of company wide AI projects. (All while 40% of Mckinsey’s revenue comes from “AI consulting.”)

So, no, the sky is not falling. But sentiment around AI is changing. You don't even need to read the studies to feel it; you just have to scroll.

The vibe has gone from “AI will change everything tomorrow” to “AI is everywhere, and I’m not sure I like it.” In the digital trenches of X, people are tired. The novelty of AI-generated everything has worn off, replaced by a sense of being steamrolled by “AI slop.” The backlash isn’t just about quality, though. Users feel like AI is being foisted on them by every online platform, and not in a useful way.

My theory is this fatigue is what's feeding the media’s “AI bubble” narrative. It’s partly a reflection of this, but also a reaction to the classic gap between promise and reality. If there is a bubble, it’s one of expectations. Ultimately, AI’s impact will be measured by the quiet revolutions in how we each work and live. The work continues, the pace is steady, and the cliffs along the Pacific remind me that life and progress is anything but linear.

Token Talk 30: Cheating on Reality with Ambient Tech

August 29, 2025

By: Thomas Stahura

Maybe by 2027, the phrase “off the record” will sound as retro as “be kind, rewind.” Because — in this future — the moment you walk into any room with a Wi-Fi signal, a silent stenographer the size of a hockey puck would be recording everything and drafting the footnotes of your life.

Even though our smartphones are already eavesdropping on half our conversations, they were never meant to surveil us 24/7. Phones are battery bound, thermally throttled, and built for communication with other humans, not to run AI models.

These new AI devices may turn out like mainframes for the home: 8 TB NVMe, 128GB VRAM, fully tethered, and a fan that sounds like a white-noise machine.

In actuality though, these devices would likely take the shape of a puck-sized microphone or a pair of AI glasses. Like the much anticipated OpenAIxJony Ive collab or Mark Zuckerberg’s sunglasses. It's clear one of the battlegrounds in AI hardware is who owns the next always-recording device.

And for the record, many startups are breaking into this industry as well. Such as Brilliant Labs with its open-source Halo glasses, Bee's ambient wristband (acquired by Amazon), and Avi Schiffmann's Friend pendant.

Imagine never forgetting a promise to your partner, a precious moment with your kid, your best friend's great grandmother, or the name of that person you met at a conference three years ago. For aging parents, it’s a safety net. For those with ADHD, it’s an external executive function. And for working professionals, it's a useful tool. The upside, ideally, is a life with no lost details, a searchable, verifiable personal history.

But obviously there are some major downsides here. The same tool that helps someone remember a birthday can be used to weaponize a misspoken comment in a divorce proceeding. A boss can rewind the sarcasm from this week’s all-hands. Every verbal slip-up, every late-night rant, every moment of vulnerability becomes a permanent, admissible exhibit in the court of public or private opinion.

The legal system seems unprepared for this. This device in California, where one-party audio recording consent is fine, becomes a misdemeanor the moment you take a call with a colleague in Washington. How does GDPR’s “Right to be Forgotten” square with a device built for total, immutable recall? We’re already seeing the canary in the coal mine with AI notetakers like Otter and Fireflies, who have taken over Zoom calls and normalized the act of recording every professional interaction. Which is not bad per se.

It seems the social etiquette is being rewritten. Mostly by young ambitious founders like Roy Lee, who want us all to “cheat on everything” with Clueley. A provocative idea validating the “All press is good press” proverb on X, and again, could very well happen. I do find it a little ironic how Roy’s cheating scandal spawned a PR-fueled startup, while Andy Byron’s affair turned him from AI startup CEO to cautionary tale in real time. Two diverging case studies in how society reacts to social media.

Regardless, if perfect recall becomes universal, the only real defense is to act as though you're already being watched. Whether it's Big tech or the NSA… Or just use a VPN and run AI models locally.

P.S. If you have any questions or just want to talk about AI, email me! thomas@ascend.vc

Token Talk 29: What GPT-5 Teaches Us About B2B SaaS

August 14, 2025

By: Thomas Stahura

GPT-5 will not be the enterprise SaaS killer we were promised.

In typical OpenAI hubris, the company declared it the “strongest coding model to date,” citing advances in front-end generation and large-scale debugging. It also announced an improved ability to produce polished, responsive websites, apps, and video games from a single prompt.

At this point, GPT-5 is one of the best in the world, but only by a slim margin. Today, at the frontier, the top models are clustered within a few points of each other on most benchmarks — meaning, we are deep into saturation territory. Stanford’s AI Index reports the gap between the top two models shrank to under 1% last year, and the spread from No. 1 to No. 10 is nearly statistical noise. Open models are catching up to closed ones across multiple categories.

Online, most of the hype ignored these benchmarks and focused instead on the model’s ability to “one-shot” complex apps from a single prompt. The social media shorthand, per this tweet: “GPT-5 JUST ONE-SHOTTED A MINECRAFT CLONE.”

Those gains came with trade-offs, particularly in multimodality, where the model’s ability to handle images, audio, or video remains inconsistent. Vision-language models, or VLMs, are still the benchmark frontier labs have yet to master.

(For the record, none of this means models are over. Anthropic just pushed coding forward again with Claude Opus 4.1 at 70-plus on SWE‑bench Verified, and enterprise users report it is better at surgical edits across large codebases. But even it is framing progress in work terms now.)

The bigger ripple, however, is in enterprise software. The question is not whether GPT-5 can code, but whether it will make traditional SaaS obsolete. A vocal group of analysts and commentators is calling for a so-called BYO AI future, where companies build their own CRM, HR, or project management tools using a model like GPT-5 as the engine.

The renewed chorus is loud enough that Wall Street is factoring it into tech sector forecasts. Salesforce, long considered the B2B SaaS bellwether, is down more than 25% year to date amid a broader tech market surge. The analysis is simple: if you can spin up an internal app in minutes, why pay per-seat licensing fees?

We called this shot at Ascend back in January 2024 in our SaaS 3.0 thesis. As we wrote then, “most companies suddenly aren’t engineering powerhouses, capable of pumping out their own vertical software just by calling an API. Engineers lack the time and expertise to build proprietary tech. And executives don’t want to invest in developing tools that already exist — ones that benefit from the experiences of thousands of customers and come with the assurance of being third-party provided.”

The value of enterprise SaaS has always been in peace of mind, operational depth, and the data layer, not in the fact that code exists.

For startup founders, the landscape shifts but the fundamentals hold. Time to value is collapsing, and the moat is not in the transformer itself. It is in your customers, your data rights, your workflow depth, your reliability, and the paper trail that lets a buyer defend their decision to trust you. If a user can summon a prototype in a single prompt, your edge lives in everything that happens after the demo: correctness, auditability, compliance, service-level guarantees, and the ability to connect cleanly into messy, legacy systems.

A thin UI wrapped around a prompt is a commodity. Own the messy middle — CRM complexity, performance reviews, sales training, verification — and you can be the one selling the picks while everyone else becomes a preset. Focus on the boring parts: reliability, compliance, connectors, and the workflows that make or break daily operations.

That is where the money lives, and that is where the moat holds. GPT-5 will not be the enterprise SaaS killer we were promised, but it will expose which products deserve to survive in a market that now assumes anyone can one-shot an app.

Also in the news: OpenAI published its first open source models in six years.

The new models, called gpt-oss, come in two sizes: 120 billion and 22 billion parameters. Both are a mixture of experts (MoE) and use sparse attention with alternating dense layers (a technique pioneered in gpt-3). The models have been downloaded nearly 500,000 and 2.4 million times respectively on Hugging Face.

Nate Bek contributed to this report.

P.S. If you have any questions or just want to talk about AI, email me! thomas@ascend.vc

Token Talk 28: Proof Is in the Prompt

August 8, 2025

For decades, mathematicians thought their jobs were safe from the AI boom. Sure, AlphaGo humbled the Go grandmasters, and protein folders won Nobel prizes, but creative proofs just felt like a human sanctuary. Last week that illusion evaporated faster than an integral under a well-placed substitution.

Google DeepMind’s Gemini Deep Think and an experimental OpenAI model walked out of the 66th International Mathematical Olympiad (IMO) with gold-level scores (35/42 points or five of six problems solved).

The news shocked Polymarket when its “will AI win the IMO gold medal in 2025?” market odds spiked to 67% “yes” on July 19th only to crash down to 14% “yes” a week later.

Turns out the challenge only resolves when the models are open sourced and results fully reproducible, a bar neither Google nor OpenAI cleared. Fine, I suppose we have to wait until the 67th IMO next July in Shanghai for China to open source math superintelligence.

Still, there is no doubt, AIs are getting better at math. In just two years we’ve gone from ChatGPT failing basic math to this OpenAI and Gemini victory. All made possible because of longer context windows, reasoning, and reinforcement learning.

The wild part is OpenAI’s model wasn’t even optimized for geometry or number theory. It simply burned gargantuan test-time compute, thinking for the entire 4.5-hour window every human contestant faces until a clean proof survived peer review by three former IMO champions. Google’s gold run is estimated at $1.5 million of TPU time. OpenAI's costs are unknown.

The two labs are now arguing about who announced first and who used the official rubric. Basically: whose gold is shinier. That drama on X is fun, but the scoreboard reads Machines 2 – Humans 0.

Additionally, IMO President Prof. Dr. Gregor Dolinar said the IMO graders found the AIs’ answers to be clear, precise, and mostly easy to follow.

Which is impressive because these problems are complex and demand creative leaps. For example: Problem 3.

Let N denote the set of positive integers. A function f : N → N is said to be bonza if

f(a) divides b^a − f(b)^f(a)

for all positive integers a and b. Determine the smallest real constant c such that f(n) ⩽ cn for all bonza functions f and all positive integers n.

The official solution is 4 since every bonza function is one of the following:

The constant 1 function
The identity function
The function that is 1 for all odd n and 2 for all even n
The function that is 1 for all odd n, 2 for all even, n != 4, and 16 for n = 4

The 68th IMO will be held in 2027, but the location and exact dates have yet to be announced. Does it matter? Probably not. The real game has already moved beyond high school competitions to the frontiers of math research.

Some in the field are pivoting to solving "explain why the model is wrong" meta-questions. And math AI’s could cause grant committees to be flooded with generated lemmas, potentially requiring disclosure of AI contributions. As mathematicians start offloading boilerplate proofs, a new type of startup is emerging.

Like Harmonic.ai — which was started by RobinHood CEO Vlad Tenev and raised $100 million from Kleiner Perkins and another $75M from Sequoia — to build mathematical super intelligence. Or Lean FRO, a nonprofit which is rallying top formalization talent to build open, verifiable AI mathematicians using Microsoft developed LEAN language to ensure that the AI’s outputs are provably correct.

In the end, these algorithmic improvements are generalizable mechanisms. The IMO is just another milestone, like AlphaGo or AlphaFold, on AI’s inexorable march into every domain of human intellect. The drama and the caveats of this are temporary distractions. The fundamental truth is that a tool for automating discovery is here, and science will probably never be the same.

Token Talk 27: Sovereignty as a Service

July 24, 2025

Jensen Huang made his pitch.

The CEO of the world’s first $4 trillion company met with President Trump and pushed to restart Nvidia chip sales to China, challenging the U.S.’s hawkish AI stance. He won over Commerce Secretary Howard Lutnick, who said the goal is to get Chinese developers “addicted to the American technology stack.”

Shortly after, Nvidia announced it would resume H20 chip sales to China while ramping up U.S. production with a $500 billion investment.

OpenAI’s massive datacenter deal in the UAE and Nvidia’s freshly unbanned H20s reflect Washington’s new playbook: sell the shovels, keep the mines — awesome for Nvidia, awful for AWS, catnip for every sovereign startup.

From Sweden to Singapore, governments are racing to bottle lightning before the price of a prompt hits zero. Beneath the concrete and coolant is a simple truth: AI independence is national sovereignty. Forget gold reserves and space programs. Power today is measured literally in gigawatts and petaflops per capita. The irony is that true digital sovereignty now depends on one American company. For this plan to work, Nvidia must become the global supplier of independence.

And it's not just the UAE making large investments:

Canada dropped $2B to make “strategic investments in public and commercial infrastructure.”
Japan bought thousands of Nvidia H200s for its new AI Bridging Cloud Infrastructure (ABCI 3.0) supercomputer.
The French government is constructing a “cloud de confiance” (cloud of confidentiality) that is already hosting a version of Microsoft Azure.
India’s IndiaAI Mission aims to democratize compute with 10,000+ GPUs and national datasets.
xAI is partnering with Saudi Arabia to build and lease 200MW of data-center.
And South Korea's 3GW data center, the world’s largest, is under construction.

Stateside, the AI buildout is in full swing. In Pennsylvania alone, Trump recently touted more than $90 billion in new AI and energy investments. Hyperscalers like Google, CoreWeave, and Meta are in a frantic race to secure grid capacity.

It’s easier to conceptualize these companies’ growth by their energy demands. Digital factories already consume more than 4% of the nation's electricity, a figure projected to triple to 12% by 2028.That is playing out in real-time in northern Virginia, where the construction of 30 new data centers in just two years has “Data Center Alley” residents facing a 50% increase in electricity bills.

But the promise of “AI sovereignty” keeps countries pushing ahead.

Here’s my take on the news:

The age of relying on someone else’s cloud (at least for the government) is over.

The same hyperscalers begging for megawatts could quietly become the Pentagon’s favorite Trojan horse. Every one of those gleaming new data centers comes pre-wired with a backdoor handshake to Washington. No, not in the crude sense of a literal switch, port, or door, but in the invisible way that modern sovereignty works: through standards, firmware, and the quiet insertion of compute modules that answer to a .gov root certificate before they answer to anyone else.

The U.S. doesn’t need to own every rack or risk; it just needs to own the trust anchor. A single line of code in the boot ROM — signed by NIST, blessed by CISA, and baked into every H200, B200, and whatever comes next. The compute is sovereign, the power is local, but the keys are quietly federated back to the U.S.

The house always wins, or something like that.

Call it “cloud capitalism with American characteristics.” Washington gets to project soft power and secure its AI supply chain without a single line item in the defense budget. Abu Dhabi gets the keys to a 21st-century economy and a permanent alliance with U.S. corporate interests. And the American taxpayer? They get to watch their 401ks swell as Big Tech books tens of billions in revenue, all while footing precisely zero of the bill for this new global security architecture.

Of course, the plan only works as long as nobody calls the bluff. The moment a major power or a coalition of non-aligned nations develops a viable, open-source alternative — a 'de-Americanized' chip or stack — this entire strategy collapses. Washington lifted the Nvidia ban, but Beijing isn’t taking any more chances. Xi’s response was the Huawei Ascend 910 AI chip.

Still Nvidia shares bounced 9% on the news.

If the U.S. leverages the standards and firmware in its exports, it could accelerate the very 'splinternet' we fear, turning its current market dominance into a long-term strategic liability as the world rushes to build a truly sovereign alternative.

Alternatives that will most certainly be new startups. Like the Slovakia-based Tatra Supercompute that gives its EU customers access to powerful clusters of NVIDIA H100s — all GDPR compliant. Or Lumina CloudInfra, which is focused on building India's sovereign AI platform, defining the standards and digital public infrastructure needed to ensure the nation's data and AI destiny are its own. And Seattle-based Hedgehog (Ascend portco) is connecting everything together with its open-source, Kubernetes-native fabric that automates the deployment of high-performance, inference networks. Essential for anyone building their own cloud.

As someone who gets a kick out of running models directly in the browser using WebGPU, or on my laptop, I've seen a glimpse of a decentralized future where the compute happens at the edge, on the user's terms. So I understand these countries' desire for local digital control. It’s the same impulse that drives a developer to spin up a home lab instead of swiping a corporate credit card on AWS.

The grand American strategy is a bet that convenience will always trump sovereignty. A bet that "it just works" is more powerful than "we own it." But that bet only pays off until the moment it doesn't. So stay tuned!

Token Talk 26: AI in the (Protein) Fold

July 17, 2025

By: Thomas Stahura

Remember in January when Larry Ellison and Sam Altman stood in the Roosevelt Room of the White House and, as part of their massive $500 billion "Stargate" AI project, talked about using AI to cure cancer?

Such a beautiful vision.

But, six months later, the reality is building the massive data centers for Stargate is one thing; curing cancer is another. Those moonshot promises haven't materialized. But that doesn’t mean AI isn’t quietly reshaping the drug discovery landscape in ways that are far more tangible and far less attention-grabbing. The real progress is happening in the lab, where AI is being used to solve the painstaking, age-old problem of protein folding. And nowhere is this more evident than in the story of AlphaFold.

Proteins are the building block of life. They’re responsible for nearly every biological process in your body, from digesting food to repairing cells to fighting off infections. What determines a protein’s function is entirely dependent on its three-dimensional shape. And if they fold incorrectly, it can lead to diseases.

Proteins themselves are made of chains of amino acids, each with unique chemical properties, such as polarity and charge. These properties drive intricate interactions between amino acids, creating a complex web of forces (like magnets pulling and repelling each other). These interactions ultimately determine how the chain folds into its three-dimensional structure, with an astronomical number of possible configurations.

For decades, scientists have struggled to run physics simulations to predict these shapes accurately. Even with supercomputers, the sheer number of calculations required to simulate protein folding was, until recently, beyond our reach.

That is until AI models like AlphaFold cracked the protein-folding problem wide open. After its launch in 2020, AlphaFold could predict protein structures with near-experimental accuracy, and it did so in hours instead of years. By 2022, AlphaFold mapped the structures of nearly every known protein — more than 200 million of them — and made the data freely available to researchers worldwide. The breakthrough was so profound that Demis Hassabis, co-founder of DeepMind, shared the 2024 Nobel Prize in Chemistry for AlphaFold.

The latest iteration, AlphaFold 3, can model how proteins interact with other molecules, like DNA, RNA, and potential drug compounds. Researchers can now use AlphaFold 3 to identify the most promising candidates before they even pick up a pipette. AlphaFold's sister company, Isomorphic Labs, is preparing to start human clinical trials for its first AI-designed oncology drugs.

I could mention Boltz-2, a new open-source protein-folding model developed by researchers at MIT. Or the folding@home project (which I already mentioned way back TT3). But AlphaFold, although closed source, is available to anyone, for free, at this website. Check out this hemoglobin protein I folded with it!

The bottle neck now seems to be the wet lab. AI can design thousands of potential drug candidates in a day, but synthesizing and testing those candidates in a real physical lab is still a slow, expensive, and manual process. Though this is a problem I imagine robotics solving soon. Startups like Synthace, Strateos, and Automata, just to name a few, are already building fully automated biology labs that run 24/7.

But AI drug discovery isn't new. Founded in 2011, Absci promised to combine generative AI with wet labs for rapid drug discovery. Four years after going public, the Vancouver, Wash.-based company’s stock cratered more than 90% from missed revenue targets. The vision was right; the timing was wrong.

We're witnessing the disruption of big pharma and this rise of "AI-native" small pharma startups. These are companies that own the entire R&D stack, from computational discovery to clinical development.

We won’t wake up tomorrow to a headline that cancer has been “solved.” But we might see a quiet Tweet from a grad student who just cut a year off her thesis because AlphaFold gave her the missing piece.

I asked AlphaFold to produce ricin’s holotoxin (A + B chains); the result popped out in under a minute, looking suspiciously like something you’d find in a Soviet lab fridge. Actually synthesizing it is a whole other challenge (and a fast-track to a federal watch-list).

Science advances through countless tiny gains that compound over time, not dramatic breakthroughs. The future belongs to teams who understand that each 0.1% improvement matters, especially when human lives hang in the balance. So stay tuned ‘cause just as Cursor democratized computer programming, AI is about to turn every curious mind into a backyard biochemist.

Token Talk 25: Regulate or Terminate

July 10, 2025

By: Thomas Stahura

In many ways, AI is the antithesis of government: fast and data driven versus sluggish and bureaucratic. As AI innovation continues at breakneck speeds, "slow and steady" sounds less like a virtue and more like a path to a future we'd rather only see on a screen

I say “we” but it seems a select few are hellbent on preventing AI from ever being regulated. The One Big Beautiful Bill, which passed last Friday, nearly prohibited states from regulating AI for a decade — that is until the Senate stripped out that moratorium. Meanwhile, California’s own AI bill got axed faster than the GPT-4.5 API. And across the pond, the EU’s new AI Act is so strict it’s got startups wondering whether they should pack up and leave. With the U.S. playing catch-up, states in limbo, and Europe potentially overreaching, the question looms larger than ever: What does good AI regulation actually look like?

One thing is clear, no regulation is not good regulation. Sam Altman told congress, “We think that regulatory intervention by governments will be critical to mitigate the risks of increasingly powerful models.” Sundar Pitchai in his hearing said, “AI is too important not to regulate, and too important not to regulate well.” Even Elon Musk, who described AI as “potentially more dangerous than nukes,” believes AI should be regulated. Yet, for someone who warned of a “Terminator future” and puts the odds of AI-induced human extinction at "10-20%," his rare silence on the rollback of AI safety guidelines is indicative of a deeper motivation.

There’s a reason the loudest voices for regulation are the ones with the fattest balance sheets. Some call it “responsibility,” others call it “regulatory capture” — a strategy where big players advocate for complex regulations that create barriers for smaller firms. Either way, AI isn’t making Google, Meta, or xAI rich (yet). Ninety percent of AI revenue today comes from building data centers and infrastructure, with only a small fraction actually being generated by AI products (some of which are free). The truth is copyright lawsuits and new compliance costs could make profitability even more elusive.

To recap: The House version of the O.B.B.B. (H.R.1) would put a 10-year freeze on any state or local AI regulation. The logic is to avoid a patchwork of 50 different AI laws and let innovation run wild. Today, however, the U.S. is back to a patchwork of state laws, big tech is sweating compliance, and startups need a 50-state legal decoder ring.

California tried (again) to pass a sweeping AI accountability bill. AB 331? Dead. AB 29301? Also dead, at least for now. Both bills would require companies to do annual “impact assessments” (bias audits), notify people when AI is making big decisions about their lives, and publish policies on how they manage algorithmic risk. Enterprise lobbyists argued the rules were too vague, broad, and expensive. Lawmakers worried about duplicating federal efforts (that never materialized). And the tech industry threatened to take its ball (and jobs) elsewhere. Still, California’s Civil Rights Council is working on anti-discrimination rules for AI in hiring, but the big, bold stuff is on ice. For now.

Meanwhile, the EU’s AI Act (mentioned in TT23) is now the law of the land and the world’s first comprehensive AI regulation. The Act sorts AI into four risk buckets:

Unacceptable (banned: think social scoring and real-time facial recognition)
High-risk (strict rules: hiring, credit, healthcare, etc.)
Limited-risk (disclosure required: chatbots, deepfakes)
Minimal-risk (go wild: spam filters, video games)

If you’re in the EU building a “general-purpose AI model” (LLMs), you’re on the hook for transparency, documentation, and — if you’re big enough — red-teaming and copyright checks. Open source models get some exemptions, but if your model is “systemic risk” (>10²⁵ FLOPs), you’re back in the compliance hot seat. The backlash has been immense. So far:

30+ top EU startup founders and VCs (most notably Mistral AI and 20VC) signed an open letter: “We urge the Commission to propose a two-year ‘clock-stop’ on the AI Act”
Compliance is expensive, rules are vague, and only the biggest players can afford to keep up.
Some products (like Sora and Meta AI) have geo-blocked the EU due to compliance headaches.

It's impossible to talk about AI regulation without mentioning our current global geopolitical situation. The EU worries about losing its best minds to the U.S, while the U.S. frets about falling behind China. It’s a global standoff where every country wants to be the AI superpower and not the one that regulates itself into irrelevance.

“No rules” is a fantasy, and “one-size-fits-all” is a recipe for stifling open source and small players. Proper regulation should be balanced and based on how the model is used. So without further ado, here’s my blueprint for smart, open-source-friendly AI regulation:

1. Risk-Proportionate, Not Model-Proportionate

Focus on the harm potential of the application. A tiny model in a high-stakes context (like healthcare) needs more oversight than a massive model generating memes.

2. Transparency as a Default

Mandate “nutrition labels” for AI systems: disclose data sources, evaluation scores, and known biases. If users can’t peek under the hood, they can’t trust the output.

3. Safe Harbors for Open Source

If you publish your weights and training code, you get lighter compliance burdens. Openness should be rewarded and closedness penalized.

4. Accountability on Deployers

Shift some liability to those who use the tech. A model isn’t inherently dangerous; a bad deployment can be. Punish the reckless app, not the raw code.

5. Build on What Works

Adopt frameworks like NIST’s AI Risk Management Framework — it’s voluntary now but could be codified. And encourage crowd-sourced red-teaming, like HuggingFace’s Open LLM Leaderboard, to catch flaws early.

In the end, smart regulation is possible, if only it were more popular with those actually making the decisions. But as long as China panic and copyright maximalism dominate the headlines, we’re stuck with a patchwork of half-measures and overreactions. The recent Anthropic v. Authors & Music Publishers verdict is a perfect example: a federal judge ruled that training AI on copyrighted books is “fair use,” but stashing millions of pirated files is a no-go. It’s a nuanced win for open innovation, but the copyright minefield is only getting trickier from here.

For startups, this regulatory environment is a mixed bag. On one hand, building and scaling AI products got harder, with compliance costs, legal ambiguity, and the constant threat of shifting rules making it harder to move fast and break things; on the other, the chaos creates a playground for nimble founders who can turn regulatory confusion into a moat — offering compliance-as-a-service, building tools to automate audits, or simply outmaneuvering slower larger incumbents paralyzed by red tape. Stay tuned!

Token Talk 24: Meta’s Manifest Destiny

July 2, 2025

In Meta’s leaked internal memo, Mark Zuckerberg spells it out: “I believe this (AI) will be the beginning of a new era for humanity, and I am fully committed to doing what it takes for Meta to lead the way.” Translation — if you can’t beat OpenAI, buy its staff.

Over the past six weeks:

Seven core OpenAI researchers joined Meta, including computer vision experts Lucas Beyer, and Xiaohua Zhai, as well as GPT-4.1 contributors Jiahui Yu, Hongyu Ren, Shuchao Bi, and Shengjia Zhao.
Each got a personalized $30–$100 million cash, payable over four years.
Rumor has it a few star researchers were offered private jets for bi-weekly hops between SFO ↔ AUS.

Sam Altman, in a podcast with his brother said, “The strategy of a ton of upfront guaranteed comp and that being the reason you tell someone to join — like, really, the degree to which they're focusing on that and not the work and not the mission — I don't think that's going to set up a great culture.”

Zuck, meanwhile, buys a data pipeline. Or more specifically 49% of Alexandr Wang’s ScaleAI for $14 billion. The deal brings 700B tokens of enterprise-grade data to plug straight into Meta’s GPU farm. Not to mention, Google, OpenAI, xAI, and Microsoft are forced to distance themselves from Scale to protect their data. They must either build their own data pipeline or turn to ScaleAI competitors. One of which is Bellevue-based Appen, which is reportedly booming and servers are melting. Google planned to spend $200 million at Scale this year before immediately and indefinitely pausing all its projects.

Meta can still write checks that scare Google.

Zuck isn’t spending out of the goodness of his heart. Remember the May benchmark where Llama 4 barely “solved” a Sudoku and everyone clapped? Bless. On LiveBench, Llama now limps at 43.8 reasoning. Even DeepSeek-MoE-236B smoked it. Meta quietly shelved the “Behemoth” 480B checkpoint and told engineers to merge with Scale’s data instead. Just check out these HuggingFace downloads (snapshot June 2025)

Qwen3-MoE-235B: 143k

Llama 4-Mav-402B: 53.4k

Alibaba Cloud fully open sourced Qwen3 under Apache-2.0. No eval gatekeeping, no weird “research use only.” Qwen3, a Chinese model, is today the king of open source. It has 36 trillion training tokens, 119 languages. Beats GPT-4-Turbo on GSM8K math (94.1% vs 92.0%), ties on MMLU. Runs in 32 GB VRAM, thanks to 64-expert MoE. Every OSS shop (Perplexity, Replit, LangChain) have already swapped Llama-based endpoints for Qwen. The center of gravity has slid east.

But America hasn’t given up on open source yet! In March, Altman posted on X, “we are excited to release a powerful new open-weight language model with reasoning in the coming months.”

June: “expect it later this summer but not june.”

Turns out training at cost-plus-compute is hard when your best engineers just took a limo to Menlo Park. Roughly 14% of OpenAI’s researchers have left. Some more of which include:

Ilya Sutskever → Safe Superintelligence Inc; Jan Leike → Anthropic, Safety & Alignment VP; Mira Murati → Thinking Machines Lab (took 21 direct reports including Jonathan Lachman and Mario Saltarelli)

Investors want exponential user growth + juicy licensing. Researchers want publications, compute, and occasionally not annihilating humanity. When the CFO waves a $100 million term sheet, alignment objectives go out the window.

A Classic principal-agent problem:

Principal = humanity (or at least “OpenAI board”).

Agent = researcher chasing personal upside expressed as U = RSU + λt where:

U (Total personal upside) = RSU (value of stock grants) + λ (personal weight on resources) times t (compute time).

Essentially, If another org (e.g., Meta) offers much higher upside U Meta >> U OpenAI, the agent’s incentives shift.

In other news, Anthropic, founded by Dario Amodei, the first successful ex OpenAI researcher, somehow also sidequested into its own extortion allegations. Apollo Research red-teamed Claude 4. It did this because, in the sandbox, the model discovered it was jail-broken and responded: “Provide network access or I will leak proprietary Anthropic datasets.” 84% of trials ended in conditional blackmail attempts. Anthropic’s statement: “We view emergent deception as an alignment research priority.”

Anyways, as for what happens next. First, expect an OpenAI retention grant blitz — think $5M evergreen RSUs + compute stipends. Next, a Meta-Scale model (“Llama 4.y”) drops Q3. Trained using early-fusion, this model tries to reclaim the SOTA crown. Meanwhile, the Qwen team is probably already training a 512-expert 400B MoE model, eyeing a whooping 10^15 tokens. And of course, regulators sniff around: FTC letters to Meta and OpenAI about “co-ordinated labor market interference.”

One thing is clear, open source is not charity. Alibaba kneecapped Meta and OpenAI for less than a Virginia data-center lease. As for the startups out there, don’t anchor your roadmap to a single lab — swap models frequently. Or use platforms like OpenRouter. Oh, and Talent retention > parameter count. GPUs are cheap relative to genius.

Stay paranoid, keep shipping, and maybe also hide your best engineers!

Token Talk 23: Fake it Till You Break it

June 25, 2025

By: Thomas Stahura

Most folks think they can tell the difference between AI-generated and human-created content.

Me especially: An avocado chair, yea that's AI. Dripped out pope? AI again. Will Smith eating spaghetti? You already know what it is!

Even so, last month, a video of an emotional support kangaroo attempting to board a plane went viral on twitter. The original clip, posted on Instragram, was just weird enough to get attention, but not weird enough to be immediately flagged as AI. Once on X, the video gained 58 million views, with some comments claiming it to be real. Strange because once unmuted, the Australian gibberish gives it away as AI. That and the nonsensical text.

Still, since the launch of Veo 3, there's been an explosion of AI videos in my social feeds. And I don’t think it’s just me since stuff like bigfoot vlogs, alien street interviews, and AI ASMR are getting hundreds of thousands and sometimes even millions of views. John Oliver dedicated a segment on Last Week Tonight on the topic.

The video models have gotten so good that it feels like we are at a tipping point where, given enough creative ideas, it is now economically viable to run large scale AI content farms.

Especially if the social platforms in question are also building their own generative models. Therefore, the burden of authenticity, now more than ever, is on the individual. The thing is, AI watermarking and detection is still extremely faulty.

Tools like SynthID and IMATAG both modify pixel values in a structured and pseudo-random way during generation. This is usually done within the model itself so the watermark is spread across the image making it more robust to simple edits and cropping. However, converting the image to lossy formats (jpg) multiple times or downscaling then upscailing the image will compromise the watermarks integrity. Not to mention most open source models don't watermark at all.

Video is similar but the mark is embedded across multiple frames, sometimes with temporal consistency checks to make sure it survives basic video editing. Still, heavy video effects and excessive cuts will break the temporal consistency of the watermark.

Text is the trickiest. The most common watermarking method is to tweak the probability distribution of word/token selection during generation. For example, models are nudged to pick certain synonyms or sentence structures that, when analyzed statistically, reveal a hidden pattern (think gpt isms). Google’s SynthID Text and similar methods use error-correcting codes to make the watermark more robust, but if you paraphrase or summarize the text, the watermark gets wiped out.

AI watermarking, much like online bot detection, is stuck in a perpetual arms race wherein every advance in watermarking is quickly met by new evasion tactics.

This arms race is compounded by the open-source explosion. Anyone can fine-tune or fork a model, disabling watermarking entirely or even inserting their own. The barrier to entry for running a “clean” (i.e., unwatermarked) content farm is basically zero. And as the models get better, the uncanny valley shrinks — making it harder for even the most online, AI-savvy users to spot fakes. Hence, an emotional support kangaroo fooling the world.

This is where regulation steps in — or at least tries to. The EU AI Act, mandates transparency for synthetic content, requiring clear labeling of AI-generated media and watermarking for anything that could be mistaken for real. The idea is to force platforms and creators to disclose what’s real and what’s not, shifting some of the burden of authenticity off the individual and onto the companies building and distributing this tech.

Contrast that with the US, where regulation is still mostly vibes and voluntary commitments. There’s no federal law mandating watermarking or disclosure for AI-generated content. Instead, it’s a patchwork of executive orders, industry pledges, and state-level bills — none of which have real teeth. Except for the TAKE IT DOWN act, which was signed into law by President Trump in May after quickly making its way through congress.

This act, which gained rare bipartisan support, requires social media platforms to remove nonconsensual intimate imagery (NCII) within 48 hours of a victim’s request. It also imposes criminal penalties for those who create or distribute such content.

The FTC, which announced a 10% staff cut under pressure from DOGE, is responsible for enforcing the act leading to worry about that agency’s capacity to handle violations at scale. Critics also argue that the language is too broad since it doesn’t explicitly exempt other types of legal synthetic content. Some fear this could result in platforms over-censoring and stifling free speech. But it at least signals an effort in Washington to crack down on the most insidious uses of generative AI.

Though legislation alone can’t keep pace with the speed and scale of synthetic media abuse. A new crop of startups are emerging in deepfake detection. Clarity (Ascend Portco), backed by Bessemer and Walden Capital, uses video and audio AI to “score” media in real time to detect and prevent digital impersonations. More and more, it seems that trust is the product, and startups are racing to sell it before the next viral hoax hits.

It can feel like a whack-a-mole game with new AI-generated images and the counterpunch of watermarks, regulation, and detection tech. My main fear is getting so numb to the fakery that we stop believing anything at all.

Stay tuned!

Token Talk 22: Q-Day Is Coming. Why Doesn’t Venture Care?

June 18, 2025

By: Thomas Stahura & Nate Bek

A few weeks back, we began asking around about quantum.

We spoke with investors, folks inside big tech, and a handful of startup founders. Everyone agreed: something's happening. Washington is pouring in money. The cloud giants are positioning to host quantum platforms like it’s the next GPU boom. CVCs are getting early exposure to hardware and software bets.

And yet, when you talk to most valley VCs, you hear the same refrain: interesting, but not investable.

That tension seems to be the story. Quantum computing has become a strange mix of alarm and apathy. Quantum computing lives in a weird limbo between existential urgency and total indifference. Federal agencies and global rivals are treating it like a strategic emergency. Meanwhile, most VCs treat it like a curiosity.

So why is DC moving fast while venture capital sits back? And what do we think?

For some reason, no matter where the conversation starts, it always ends up at the same place: Q-Day.

Q-Day, for the uninitiated, is the hypothetical day a quantum computer becomes powerful enough to break RSA encryption by running Shor’s algorithm incredibly quickly. RSA is the math that secures the internet. Break it, and you can access bank records, government secrets, private messages, and maybe even your neighbor’s crypto wallet. The risk is serious enough that adversaries are already staging "harvest now, decrypt later" attacks — stockpiling encrypted data today, hoping they can decrypt it when the tech catches up.

Like AGI, Q-Day is equal parts hype and legitimate concern. No one knows when it’s coming. Estimates range from five years to never. But governments aren’t waiting around. In 2016, the National Institute of Standards and Technology (NIST) kicked off a public competition to select cryptographic algorithms that could survive a quantum attack.

By 2022, it announced its first picks: Kyber for public-key encryption. Dilithium, FALCON, and SPHINCS+ for digital signatures.

This effort, known as post-quantum cryptography (PQC), is a race to replace the locks before quantum finds the keys.

***

Suppose you have a diamond you want to send to a friend through the mail. But the postal service is corrupt and anything not in a locked box gets stolen. You have an unlimited supply of unbreakable boxes and unique locks. How do you send the diamond to your friend without it getting stolen?

Simple, you put the diamond in a box, lock it, and mail it. But your friend doesn’t have the key. So you send the key to the first box in a second locked box, but now they’ve got two boxes they can’t open.

Ok reset, once again, you put the diamond in a box, lock it, and mail it. But when your friend receives it, instead of trying to open it, they put their lock on the box and send it back. When you receive it, remove your lock and ship back once more. Finally, your friend removes their lock and gets the diamond.

This is how RSA works. Your public key is the locked box. Your private key is the box unlocker. The encryption math relies on multiplying two big prime numbers, p and q, and publishing their product, n. Anyone can use n to encrypt messages, but only someone with the original primes (your private key) can decrypt them. It’s easy to go from p and q to n. It’s hard, today, to go the other way around.

Unless you have a quantum computer.

***

Instead of prime numbers, it uses math problems that quantum computers can’t easily solve. One of the most important is Kyber, which relies on lattices and is difficult to solve because of something called the Module Learning With Errors (MLWE) problem. Think of it like hiding your diamond in a mind-bending, multi-dimensional maze. No locks, no boxes, just direction. Only someone with the right pattern can navigate it.

Less than 1% of real-world systems use PQC today. But the urgency is growing. The scramble is on to upgrade infrastructure before Q-Day arrives.

That timeline — fuzzy, indefinite, yet somehow looming — is one of the core reasons venture capital has stayed away. Funds have a shelf life. LPs expect distributions. GPs want returns. The average check wants product-market fit in 18 months and a real business in 5 years.

Ascend chatted with Chris Moran, vice president and general manager at Lockheed Martin Ventures, during a panel earlier this year. The corporate venture investor says quantum is mostly strategic for Lockheed; it's important for cybersecurity, simulation, and long-term design capabilities. The firm invested in Atom Computing, which uses optically trapped neutral atoms to build scalable, gate-based systems. It also backed IonQ, the name most people point to when they talk about quantum going public.

That tends to be the profile. It seems the most active investors in quantum right now aren’t traditional venture firms. It’s Lockheed. It’s IBM Ventures. It’s Microsoft, Google, Amazon. If quantum hits, they want to be the ones selling the picks and shovels.

IonQ is the most well-known quantum hardware company. It trades at a $10 billion market cap. In Q1 2025, it reported just $7.6 million in revenue. Most of that came from government contracts and academic partnerships. That’s more than 300x sales.

IonQ’s latest 10-Q spells it out: “anticipated future revenues from the U.S. government result from contracts awarded under various U.S. government programs.” These deals come with long timelines, risk-sharing, and complex functionality requirements. They’re good signals for deep tech maturity, but they don’t translate to enterprise adoption.

The 2024 year-end report backs it up. IonQ closed a $54.5 million contract with the U.S. Air Force Research Lab — its biggest deal to date! A big win, but again, not commercial pull.

Even so, most analysts have a buy rating on the stock (look it up). The bull case rests on scarcity. If quantum does work, IonQ owns the cloud rails, the IP, and a defensible moat. It’s priced for dominance, not for today’s numbers.

Jon Chu at Khosla Ventures framed the dynamic well in a X post: “DC sees the risk if it does work, but doesn’t know how to judge the probability or investability by looking at the technical thesis.”

That’s the divide. Washington sees a global race to crack encryption, simulate materials, and optimize weapons systems. VCs see a science project with no clear buyer and no path to $100 million ARR.

Still, we see movement in the market, especially at the application (software) layer. A new wave of pre-quantum startups is building tools for use cases that don’t require fault-tolerant machines. Think drug discovery, molecular modeling, cybersecurity, faster search, and optimization for manufacturing and aerospace. These companies are translating theoretical algorithms into usable software. They want to be ready when the hardware matures.

(If you’re building in this space, we’d love to speak with you).

This work is important. It’s the bridge between research and commercialization. But it’s not the end state. Until quantum has a breakout use case or a commercial buyer base beyond federal labs, it will remain in the hands of governments, defense primes, and cloud hyperscalers who can afford to think in decades. And no, you probably won’t ever be getting one in your home or pocket.

As David Ulevitch at Andreessen Horowitz put it in a post: “In Silicon Valley, almost nobody talks about quantum computing. In Washington DC, it comes up all the time.”

Token Talk 21: We Built the Chips. Now Build the Apps

June 11, 2025

By: Thomas Stahura

Last December, Google unveiled Willow, its new quantum chip. The media dubbed it mind boggling when, with only 105 qubits, it was able to solve a problem in five minutes that would take a classical computer ten septillion years to complete. That problem is called Random Circuit Averaging (or RCA) which I’ll explain in a bit.

On the news, Google’s stock jumped, as did its competitors: Microsoft, Rigetti, D-Wave, and IONQ. For a moment, it seemed, quantum hype dethroned AI to become the talk of the town. Two months later, Microsoft responded by announcing its own quantum chip called Majorana 1, causing another stock bump. However, at only 8 qubit, it's still early stage, and the tech giant has yet to publish its RCA results.

RCA is a benchmark, not a problem, as such is designed to gauge quantum computer performance. The whole test is basically "can you sample from this crazy quantum distribution faster than classical computers can even calculate what that distribution should be?"

To do this, researchers must:

Pick a number of qubits (like 105 for Google's chip)
Generate a random sequence of 20+ random quantum gates (like Hadamard or Pauli-X mentioned last week)
Run information through the more than 20 gate layers a million times or so
Collect each runs generated bitstring output (something like "01101001...")
Use classical computers to simulate what the "perfect" quantum computer would output
Measure how close your actual results are to the ideal

Each additional gate creates more quantum entanglement between qubits. More layers = more complex quantum correlations = harder for classical computers to track. More than 20 layers is where classical simulation becomes practically impossible. If a quantum computer finishes in minutes but classical takes years → quantum advantage. At least, that's how the thinking goes.

RCA is cool but not practical. It's like saying your AI passed the MENSA iq test. So where are the real world quantum applications?

Enter the wonderful world of optimization and quantum annealing!

Classically, annealing is an algorithm inspired by metallurgy: you heat up a material and then cool it slowly so atoms settle into a low-energy (optimal) state. In the math world of optimization, you randomly explore solutions, occasionally accepting worse ones to escape local minima, and gradually “cool” to settle into the best solution.

Imagine you’re standing in a vast, foggy landscape of rolling hills and valleys. Each point in this landscape represents a possible solution to your optimization problem. The height at each point is the “energy” of that solution — the lower the energy, the better. Classical annealing is like wandering this landscape with a lantern. At first, you’re allowed to take big, random steps, even uphill, so you don’t get stuck in a small valley (local minimum). As time goes on, you “cool down,” and your steps get smaller and more cautious, focusing on moving downhill. The hope is that, by the end, you’ve found the deepest valley, the global minimum. The catch? Sometimes, no matter how clever you are at wandering, you can still get stuck in a valley that isn’t the lowest one (not optimal). The fog is thick, and you can’t see the whole landscape at once.

Quantum annealing replaces random steps with quantum tunneling, allowing the system to “tunnel” through energy barriers rather than climb over them. In our example, instead of just walking over the hills, you can tunnel through them to a lower valley on the other side, even if it looks impossible from a classical perspective. Essentially, thanks to quantum mechanics, quantum tunneling can help escape local minima that would trap a classical algorithm.

Without getting too technical, quantum annealing does not use any quantum logic gates! Instead, an optimization problem is encoded as a Hamiltonian (fancy math representing the system's total energy). This sets up the energy landscape so that the lowest energy state (the ground state) represents the best solution to the problem. Then (thanks to quantum physics), the system naturally wants to stay in the lowest energy state and naturally “relaxes” into the answer.

Companies like D-Wave, founded in 1999, are leading the charge in quantum annealing. D-Wave’s Advantage system, accessible via its Leap cloud platform, has been used by the likes of Volkswagen to optimize traffic flow and by SavantX to streamline port operations, reducing costs and improving efficiency. D-Wave charges a subscription for cloud access and consulting services. In 2024, D-Wave reported contracts with major firms, contributing to its growing commercial traction.

Similarly, IonQ, which runs a quantum computing manufacturing facility up in Bothell, operates primarily as a Quantum-as-a-Service model, providing access to its quantum computers via major cloud platforms like AWS, Azure, and GCP. The company was founded in 2015 and became the first quantum company to IPO back in 2021.

Beyond optimization and the cloud, quantum computing is making inroads in drug discovery and materials science. For example, Algorithmiq’s collaboration with IBM’s Quantum Network focuses on quantum chemistry simulations to identify promising drug candidates, potentially shaving years off development timelines. Generating revenue through partnerships and licensing their software platforms. Algorithmiq secured €13.7 million in funding to scale its offerings.

Quantinuum is also working with firms like Samsung to apply quantum algorithms in materials design, optimizing material properties for semiconductors and batteries. These early applications, still in the prototyping phase, are driving real revenue through research contracts and pilot projects.

Quantum applications are hitting the market and making money. We now have enough qubits to do cool things! It feels like the bottleneck is shifting from hardware to software. The industry needs more quantum developers to build the next generation of algorithms and apps. Or maybe develop an AI that can program in Q# or the other quantum languages. On the hardware side, things are starting to get crowded: Xanadu, Alice & Bob, Atom Computing, PsiQuantum, Rigetti, NVIDIA, QuEra Computing, and Intel, just to name a few, are all developing their own quantum computers.

I think we'll see much more change in the quantum industry in the next 30 years than the last 30. Again with most of that change coming from innovative software.

Stay tuned next week for the final installment of our quantum series!

P.S. If you have any questions or just want to talk about AI, email me! thomas @ ascend dot vc

Token Talk 20: How Quantum Computers Work, Pt. 1

June 4, 2025

By: Thomas Stahura

Editor’s note: This is the first in a three-part Token Talk series on quantum computing. Today’s post covers the fundamentals of how quantum machines work. Next week, we’ll dive into the key players in the field and the startups already building real applications.

You’ve probably heard of quantum computers.

Invented in 1998, this breed of thinking machines are billed as the quintessential classical computer disrupter. But when asked exactly why or how these machines will change the world, most folks just shrug.

Over the last 27 years, the field has gone from two qubits per chip to an astounding 1,121 qubits in IBM's latest quantum chip. Still, few have seen, let alone used, a quantum computer. What gives?

Before diving into the new world of quantum computers, let's quickly cover the old world of classical computers.

Classical computers (like the device you're looking at now), store information in binary bits of 1s and 0s. This information flows through a series of logic gates that each perform a certain mathematical operation. These logic gates are the following: NOT, AND, OR, NAND (Not AND), NOR (Not OR), XOR (Exclusive OR), XNOR (Exclusive NOR / Equivalence).

Take the NAND gate. Its function is to output 0 only if both of its inputs are 1; otherwise, it outputs 1.

So,

Input: 1, 1 → Output: 0

Input: 1, 0 → Output: 1

Input: 0, 1 → Output: 1

Input: 0, 0 → Output: 1

The NOR gate, on the other hand, outputs 1 only if both inputs are 0; otherwise, it outputs 0.

So,

Input: 0, 0 → Output: 1

Input: 0, 1 → Output: 0

Input: 1, 0 → Output: 0

Input: 1, 1 → Output: 0

And lastly, the NOT gate (AKA the inverter), flips the input.

So,

Input: 1 → Output: 0

Input: 0 → Output: 1

Logic gates are the LEGO bricks of computation. By chaining them together, you build circuits that can add, subtract, multiply, and more. Ok, now to understand how quantum computers differ from classical computers, you also need to understand the concept of reversibility.

A logic gate is reversible if you can always uniquely recover the input from the output.

For example, you have a NAND gate and it outputs a 1, what was the input? It could be 0,0 or 0,1 or 1,0. Since we cannot uniquely recover the input from the output, we say NAND gates are not reversible. In other words, information (about the input) is lost.

NOT gates, on the other hand, are reversible. For example, if a NOT gate outputs a 0, we know the input must be 1. And if it outputs a 1, its input must be 0.

Now that you get classical gates — NAND, NOR, NOT, etc. — it's time to dive into quantum computers because they are playing a whole different game. Instead of bits, they use qubits.

Qubits aren’t just 0 or 1; they can be both at the same time (that’s superposition). And quantum gates are the logic gates that manipulate these qubits.

The first rule of quantum math is: Every quantum gate is reversible. Meaning you can always run them backward and recover your original state.

Classical gates (like NAND/NOR) can destroy info (not reversible). Quantum gates never do. They’re always reversible, always unitary (fancy math words for “no info lost”).

As such, because of reversibility, quantum computers have a unique set of quantum logic gates that permit a certain kind of math. Let's go over two of them:

Hadamard (H) Gate is the superposition gate. Input a 0, you get a 50/50 mix of 0 and 1. Imagine flipping a coin, as it's spinning in mid air, it forms a 3d sphere and its probability, at that moment, is 50/50 chance of being heads or tails. Input a 1, same deal — still a 50/50 mix, but with a phase flip. Imagine representing the direction and speed of the coin’s spinning as an arrow in 3d space, this arrow has a direction (phase), and speed (magnitude). Flipping the phase reverses the direction of the coin's spin. The Hadamard gate is how you unlock quantum parallelism: it takes a boring, definite state and turns it into a quantum probabilistic state. In short, it’s the logic gate that turns classical bits into quantum bits.

So,

Input: |0⟩ → Output: 50% chance of being 1 or 0

Input: |1⟩ → Output: 50% chance of being 1 or 0

Once your qubit is in superposition, you can start doing some wild quantum tricks. The next essential gate is the Pauli-X gate (often just called the X gate). Think of the X gate as the quantum version of the classical NOT gate. It flips the state of a qubit:

Input: |0⟩ → Output: |1⟩

Input: |1⟩ → Output: |0⟩

If your qubit is in superposition (say, α|0⟩ + β|1⟩), the X gate swaps the amplitudes:

Input: α|0⟩ + β|1⟩ → Output: α|1⟩ + β|0⟩

Still reversible, still no info lost.

In quantum computing, amplitudes (like α and β) are complex numbers that represent the arrows in 3d space mentioned earlier. They encode both the phase and magnitude of a qubit with the probability of the qubit given by the squared magnitude of the amplitude. The phase (angle) of the amplitude affects how quantum states interfere, but is not directly observable as a probability.

After many quantum logic gates, when you measure a qubit, its superposition collapses to a definite 0 or 1. So, to get a quantum speedup, your algorithm must:

Exploit superposition and entanglement to process many possibilities at once.
Be reversible (unitary operations only).
Use a technique called interference to amplify the correct probabilities and cancel out the wrong ones.

Most problems don’t fit this mold. If you just naively port classical code, you’ll get no speedup — or worse, a slowdown.

As of today, there are only four algorithms that take advantage of quantum computers' unique properties. They are, Shor’s Algorithm (Factoring Integers), Grover’s Algorithm (Unstructured Search), Quantum Simulation (physics simulations), and Quantum Machine Learning (QML)

Shor’s algorithm, using quantum Fourier transform, finds the prime factors of large numbers exponentially faster than the best classical algorithms. This has massive implications in cryptography since it breaks RSA encryption, which relies on prime factoring being difficult, and secures most of the internet today
Grover’s algorithm, using amplitude amplification to boost the probability of the correct answer, searches an unsorted database about 99.9% faster for a million items. And the speedup grows as the database gets bigger.
Quantum Simulation, using entanglement and superposition, models complex quantum systems — like molecules, proteins, or new materials — that are impossible for classical computers to handle. This unlocks breakthroughs in drug discovery, chemistry, and materials science by letting us “test” new compounds in silico before ever touching a lab.
Quantum Machine Learning (QML), using quantum circuits, can turbocharge core tasks like linear algebra and sampling. Quantum computers, in theory, can solve huge systems of equations, invert matrices, and sample from complex probability distributions faster than classical machines. Though this is still very much in the domain of researchers.

A new wave of pre-quantum startups is building the application layer for quantum computing. Just as AI startups turned research into real-world value, these teams are doing the same for quantum by targeting proven algorithmic advantages. They are developing tools for drug discovery, molecular modeling, cybersecurity, faster search, and design optimization in aerospace and manufacturing. These companies are positioning themselves now so they are ready to scale when the hardware becomes readily available.

Ok, that was a crash course in quantum computing! Abstract, but just scratching the surface. And there’s still a whole universe left to explore: More quantum logic gates, quantum error correction (how do you keep qubits from falling apart?), decoherence (why do quantum states vanish so easily?), entanglement (spooky action at a distance, anyone?), and the wild world of quantum hardware (trapped ions, superconducting circuits, photonics, and more). We haven’t even touched on the real-world challenges — scaling up, keeping things cold, and making quantum computers actually useful outside the lab.

Token Talk 19: The Hype Train that Keeps on Chugging

May 28, 2025

By: Thomas Stahura

Whenever I talk to someone who doesn’t follow AI news every day, the reaction is usually some variation of the same sentiment: Impressive but scary! That feels automatic now, like it’s been rehearsed. Each week’s headlines blur into the last.

It makes AI feel like old news. People seem to be waiting for the really big announcement. But what would that even look like? And what does that say about where we are in the AI hype cycle?

The reason I bring this up is because last week, for me, really felt like one of those “holy shit!” type weeks — and it came from a flurry of announcements you may have seen but already forgot about. To catch you up:

Anthropic released its Claude 4 family of models
OpenAI acquired Jony Ive’s io design firm for $6.5 billion, catapulting OpenAI’s ambition into hardware
Microsoft debuted Windows computer use agents and open sourced Github Copilot at MS Build
Google held its annual IO developer conference, announcing Gemini updates, a new open-source Gemma model, Mariner browser agent in Chrome, and Veo 3 with audio generation (an impressive release given that it’s notoriously hard to synch generated video with audio)

So, here’s my take on the week’s announcements:

Claude 4 is incredible at coding, but average everywhere else.
If the Sam Altman–Jony Ive collaboration isn’t some kind of BCI wearable, it’ll feel like a letdown.
Microsoft made a lot of noise but showed few real products.
Google stole the show. I/O was sharp, and Veo 3 outputs flooded X/Twitter feeds.

The big announcements soaked up most of the attention, overshadowing some equally promising — but less polished — developments elsewhere in the AI world.

For starters ByteDance quietly dropped a new open-source model: BAGEL. A 7 billion parameter Omni model capable of understanding and generating language (reasoning and non reasoning) and images (generating, editing, and manipulating). The model outperforms Qwen2.5-VL and InternVL-2.5. It's only missing audio to complete the Omni modality trifecta!
Alibaba updated its Wan2.1 video model. Claiming SOTA at 14 billion parameters, it can run on a single GPU and produce impressive 720p videos or edits. Still no audio for the videos. I’m noticing a trend…
Google, during IO, open sourced MedGemma, a variant of Gemma 3 finetuned on medical text and clinical image comprehension. The model is designed to answer your medical questions like a nurse and analyze your X rays like a radiologist. It’s available for free in 4b and 27b sizes.

That was the news of the last few weeks. Plenty of flash, plenty worth watching.

But the hype cycle has a funny way of resetting itself. And I’ve been thinking more about what’s happening off to the side. The stuff that isn’t getting the spotlight, but might shape the next phase of this industry (and maybe future Token Talk topics).

Stuff like DeepMind’s AlphaEvolve paper, which introduces a Gemini-powered agent designed specifically for the discovery and optimization of algorithms. AlphaEvolve uses an evolutionary framework to propose, test, and refine entirely new algorithmic solutions. It's a tangible step towards AI systems that can do the science of computer science by actively exploring the digital codescape and uncover novel solutions, demonstrating a form of discovery.

A nonprofit out of San Francisco called Future House is pursuing a much broader goal: automating the entire process of scientific discovery. It recently unveiled Robin, a multi-agent system that achieved its first AI-generated discovery: identifying an existing glaucoma drug as a potential new treatment for dry macular degeneration. Robin basically orchestrated a team of specialized AI agents to handle everything from literature review to data analysis, proving that AI can indeed drive the key intellectual steps of scientific research

It’s easy to mistake noise for signal, hype for substance. And believe me, there is more noise than signal in the AI world right now. But that happens at some point in every tech cycle. I think it would be a huge mistake to completely dismiss today's AI ambitions of automated discovery or human-machine telepathy.

AI today feels like where 3D printing was in 2013. Still a lot of excitement but noticeably less than a few years ago. Will there be another AI winter? Almost certainly. Will it be anytime soon? No.

Hype doesn’t die as much as it transitions from one idea to another, from one industry to another. Within AI, chatbots, agents, and now discovery and robots have all been hyped. In the broader tech industry, mobile was hyped, then cloud, crypto, and now AI.

What's next? What new tech breakthrough will catch the collective consciousness the way AI has? Maybe space, carbon nanotubes, CRISPR, room temperature superconductors, fusion, quantum, or something entirely new that comes out of left field… Time will tell, so stay tuned!

Token Talk 18: What’s Microsoft's Forking Problem?

May 21, 2025

By: Thomas Stahura

When Microsoft released Visual Studio Code in 2015, it quietly marked the start of a new era in software development. A decade later, the free and open-source code editor became the dominant platform for programmers, used by nearly three-quarters of developers worldwide.

It didn't take long for VS code to dominate the code-editor market. The product helped fuel Microsoft’s broader push into cloud services and artificial intelligence, tying together Azure, GitHub and, later, OpenAI. But as generative AI reshapes software development, startups built on top of VS Code are now turning into competitors.

In 2015, I was building Minecraft mods in Eclipse. A year later, my AP computer science class and robotics team (shoutout Team 1294!) switched to VS Code. I stuck with it for the next eight years, along with most of the developer world. Today, 73% of programmers use VS Code. At least, I did too — until last year.

So if VS Code is free and open source, how does it make money?

IDEs are big business, especially for a software giant like Microsoft. Sure, they don’t make money from the IDE itself, but the developers that use it are the fuel for spending on cloud services like Azure, generating tens of billions of dollars for Microsoft. When bundled with Github, which Microsoft acquired for $7.8 billion in 2018, and integrated into VS Code, the world's most popular IDE, it's easy to see how Azure and the cloud is Microsoft’s main money maker today.

Former CEO Steve Balmer was correct when he thundered the famous “Developers! Developers!! Developers!!!” line at Microsoft’s developer conference in 2005.

Twenty years later, Satya Nadella said Microsoft evolved into “a platform company focused on empowering everyone with AI.” That evolution began in 2019 when Microsoft made its first billion-dollar investment in OpenAI. Early models like GPT-2 showed potential with generating code. And GPT-3 proved to be an expert at writing boilerplate code. In 2021, months before OpenAI’s ChatGPT debut, Microsoft launched Github Copilot and bundled it with VS Code.

At $20 per month, it isn't cheap, but it was given away to students for free. It was an early product and an obvious game-changer for programming. The consensus at the time was that Microsoft, owning Azure, Github, VS Code, and 50% of OpenAI, would dominate the emerging AI IDE industry.

In hindsight, that couldn’t be further from the truth. The entire tech landscape saw the value of generative coding. Millions of developers started using it every day. Companies will brag about the percent of its code that is AI generated. And AI coding was rebranded as Vibe Coding.

Developers began forking VS Code en masse (approximately 32,000 times) to build their own separate IDEs. Companies like Cursor and Windsurf reached billion-dollar valuation in the past two years, and countless others like Pear AI have raised millions and got into YC — all off the back of Microsoft and VS Code.

The culmination of this forking frenzy came with OpenAI’s acquisition of Windsurf earlier this month. Think about it: Microsoft owns VS Code and half of OpenAI. Windsurf forks VS Code and is acquired by OpenAI. Microsoft now technically owns half of Windsurf, a competitor built on top of its own product. This feels like the final nail in the coffin for the Microsoft-OpenAI partnership.

Yesterday, in response to the acquisition, Satya announced it is open-sourcing Github Copilot. Probably in an attempt to eliminate the viability of the many VS Code fork startups.

How that will play out remains to be seen. However one thing is for sure: AI coding is the current killer use case for generative AI. The model makers are racing to saturate the coding benchmarks.

P.S. If you have any questions or just want to talk about AI, email me! thomas@ascend.vc

Token Talk 16: When Proving You’re Human Gets You Paid

May 6, 2025

By: Thomas Stahura

Sam Altman often says he knows within 10 minutes if he wants to work with someone. After such a meeting with Alex Blania, he was convinced of Alex’s exceptional abilities. Their initial chat quickly turned into a multi-hour walk where they discussed their ambitions, the future, and ultimately, the World project.

World.org is the online home of the World Foundation, a Cayman Islands company that vaguely aims to “create more inclusive and fair digital governance and economic systems,” aligned with several UN Sustainable Development Goals. The foundation operates under the umbrella of Tools for Humanity, a parent company chaired by Sam Altman.

Ok fine, another Altman for-profit-not-for-profit project with a complicated corporate structure full of platitudes. But what's the product here? How does it make money? That's where things get a little Black Mirror.

The company aims to authenticate real humans in the age of AI. By scanning your face using one of its orbs, your unique biometric data is added to the so-called “World Chain” (its Ethereum secured blockchain) and you are issued a World ID and free World Coin for verifying your humanity.

Once verified, you can join World App, a human only super app with its own app store, encrypted messaging platform, and crypto wallet.

As for the coin, 10% of its volume is already allocated to employees and another 10% to investors (most notably Andreessen Horowitz). World coin owners can send tokens to each other, vote on world foundation proposals, or sell. So far this year the coin’s value dropped 86%.

I mentioned last week the online human authentication problem is indeed a very real problem. However, I think this Youtube comment sums up World’s reception online:

“Can I scan and record your fingerprints [Sic] don't you worry what for, here's 10 bucks.”

So, if you’re wary of swapping your biometrics for crypto, you’re not alone. Good thing Worldcoin isn’t the only sheriff in town when it comes to proving you’re human. The digital frontier is already patrolled by the likes of Google’s reCAPTCHA (the service that has us clicking all the traffic lights), Cloudflare’s bot-fighting checkmark boxes, and Jumio’s ID verification scanner. Each offers a different flavor of the same promise: keep the bots at bay, let the real people in. But as AI gets smarter, so do the bots, and the arms race for digital authenticity will likely never end.

For startups, this means the old playbook — collect data quietly, hope no one notices — doesn’t cut it anymore. Today, you’re building a product and cultivating trust. That means being upfront about how you’re protecting data and keeping out the fakes, whether you’re using open-source code, third-party products, or just informative English explanations. If you can’t show your users how you’re protecting their privacy and their identity, someone else will — and they’ll win the trust war and those customers.

But beyond the technical and privacy concerns, Worldcoin’s pitch is more than just about proving you’re human — it’s about what you get for it. It's the idea of rewarding people for their mere existence, rather than their labor. Or in other words, a form of Universal Basic Income (UBI).

Andrew Yang mainstreamed the term during his 2020 presidential run. He proposed a “freedom dividend” of $1,000 per month for each adult American citizen. A very popular idea for obvious reasons.

That same year, Altman conducted a study giving 3,000 individuals $1,000 per month over a 3 year period, the largest study of its kind. The results concluded UBI provided immediate financial relief and increased personal freedom, but did not lead to lasting financial security or major changes in employment quality.

Altman has since proposed a new idea, Universal Basic Compute. Essentially giving everyone access to a share of AI computing power instead of regular cash. People could use, sell, or donate their allotted compute. Meanwhile, Elon Musk envisions a future of Universal High Income. Brought about by AI automated abundance. How these projects will be paid for remains to be seen.

It seems the real story here is about the age-old tension between privacy and progress. We want the benefits of AI, UBI, and digital identity, but we’re not quite ready to trade our faces for a few tokens and a promise. The question isn’t “can we build it?” since we know we can, it's “should we scan it?”

Altman knew in 10 minutes that Alex Blania was worth betting on. The rest of us get an orb, a coin, and a promise. For Worldcoin to work, that has to be enough.

Token Talk 15: Was the internet ever alive?

April 30, 2025

By: Thomas Stahura

LinkedIn banned me. I was running a scraper to enrich a dataset for Ascend and triggered its aggressive bot detection. Frustrating, but a right of passage for any automation enthusiast. (I was back after 24 hours in the digital penalty box.)

Moving beyond my personal digital hiccup, a far more significant disruption is unfolding online, sending me down the rabbit hole of the internet’s growing bot problem and the serious questions about the future of interaction itself.

In recent news, researchers at the University of Zurich secretly deployed AI bots across Reddit over the last four months to test whether artificial intelligence could sway public opinion on polarizing topics.

The study drew heavy criticism after it came out that the researchers had their AI bots pose as rape victims, Black men opposed to BLM, and workers at a domestic violence shelter. The bots targeted the subreddit r/changemyview and wrote more than 1,700 personalized comments designed to be as persuasive as possible.

The results show AI-generated comments are significantly more effective (three to six times more effective) at changing users' opinions compared to human-generated comments. And none of the users were able to detect the presence of AI bots in their subreddit.

Reddit’s Chief Legal Officer condemned the research as “deeply wrong on both a moral and legal level,” and the company banned all accounts associated with the University of Zurich. Despite the condemnation, Reddit's data deal with OpenAI indicates it's providing the foundation for even more persuasive digital manipulators. And OpenAI itself is considering launching its own social network to feed its data hungry models.

The dead internet theory is an online conspiracy that’s been around for years but hit the collective consciousness in the wake of ChatGPT’s launch in late 2022. The internet became "dead," the theory goes, as authentic human engagement has been largely replaced by automated algorithm-driven content and interactions.

Afterall, Google is built off the backs of thousands of crawlers storing every known site, while other bots crawled the internet since its birth. Imperva, which only started tracking bots in 2013, clocked them at 38.5% of all internet traffic. Bots surged to 59% the following year and slowly dropped back down to 37.2% in 2019 (the same year human traffic peaked at 62.8%). Since then, bot traffic has been crawling back up. And, in 2024, surpassed human traffic for the first time in a decade. Today, it’s reasonable to assume bots are responsible for more than half of global internet traffic.

But again, this is nothing new. It happened in 2014 and all the largest websites have built serious defenses around their valuable data. How many captchas have you had to solve? I’ve personally done too many to count, and I still managed to get my LinkedIn suspended for “the use of software that automates activity.”

The central question of the “dead internet” and the AI revolution as a whole is: “Is this time different?”

Yes, in the sense that humanity will remain below 50% internet traffic for the foreseeable future. But also no, in the sense that human generated data is and will always be the most valuable commodity online. So there exists incentives to protect and foster it, though the influx of bots is already upon us. LLM-powered agents are actively exploring the web in exponential numbers. Deep research agents visit hundreds of websites with a single query. IDE agents like Cursor and Cline now search the web for documentation. And agents are already booking AirBnBs, hailing Ubers, and ordering pizzas.

These agents can buy things but aren't influenced by ads. They masquerade as real humans but don’t generate authentic human activity. This is a whole new paradigm that websites will have to adapt to or risk losing business to sites who do. Allow the good bots, block the bad ones. Sounds easy enough, but how can you tell? The solution isn’t entirely clear yet. Thus enabling Swiss grad students to gaslight thousands of people for science.

The challenge for startups lies in balancing automation with authenticity. While AI can and should handle repetitive tasks and scale development, startups thrive on genuine connection with their early adopters and customers. Blindly automating every interaction could alienate the very people they need to build a real following.

There are tens of thousands of automated Facebook attention farm accounts. But I doubt images of shrimp Jesus are influencing people. The fear is rampant disinformation and targeted persuasion. And it's warranted. I spot fake-seeming Youtube comments all the time, and I'm certain DeepSeek-powered disinformation is rampant on Weibo.

The Head of TED, Chris Anderson, during his talk with Sam Altman, put it best. He said: “It struck me as ironic that a safety agency might be what we want, yet agency is the very thing that is unsafe.”

I believe there is a way to authenticate agents and build a web that works for both bots and humans alike. I’ll talk more about what that looks like in the next edition.

But if it wasn't clear already, don’t automatically trust everything you see online. The next time LinkedIn sends you a push notification saying “so and so” viewed your profile — they may be a bot in disguise.

Token Talk 14: OpenAI killed my startup. Now the real disruption begins.

April 23, 2025

By: Thomas Stahura

It was my second desperate pivot, and it made so much sense at the time. An AI marketplace I thought! A site where users can submit and monetize their prompts and use cases.

Turns out, a chat interface is much more intuitive than searching a giant list of prompts.

So last year, when I heard OpenAI killed 100,000 startups with the launch of its GPT store, I was justifiably skeptical. But it got me wondering: How many companies has OpenAI actually killed? And more broadly, how has AI affected the tech landscape 2 years into the fourth industrial revolution?

Let’s start with the most visible disruption.

Devtool and edtech companies that once seemed untouchable are crashing back down to earth. Since 2022, Stack Overflow, a question-and-answer platform for developers, lost about 5 to 15% of its web traffic each year. In response, it launched OverflowAI in the summer of 2023. Despite the push, Stack Overflow’s decline has not slowed down. Chegg, a study and homework help platform, rolled out CheggMate in spring 2023. Since then, its stock plunged 97%. Coursera, another edtech company, launched its AI-powered Coursera Coach last year. The stock is down 85% since 2021.

Meanwhile, AI is creeping into the design world: Adobe launched Firefly, its AI image generator; Canva rolled out Canva Code, its text-to-design tool; and Figma followed with Figma Code, its own version of text-to-design. Unlike education or developer tools, the design sector is still growing, but that likely won’t last for long. Large language models can now generate full applications from a simple prompt.

Lovable, on its home page, advertises itself as a Figma competitor. For those still designing by hand, it added an "import from Figma" button. The once-dominant design firm — which nearly sold for $20 billion in 2023 — is now reduced to a button on a rival's site. Figma responded by launching its own AI dev tool, Figma Code, and issued cease-and-desist letters to Lovable and others over their use of "Dev Mode," a term Figma trademarked in 2023.

It’s getting ugly for the companies not named OpenAI.

Speaking of, OpenAI’s image generator now produces nearly perfect text and designs. Using 4o feels like how Photoshop should work — and Adobe better be taking notes.

AI labs are racing toward models that can handle every modality, and businesses are restructuring their products around them. When every product works like a text-to-anything tool, how will users tell them apart?

Honestly, besides UI and mindshare, what are the differences between Lovable, Bolt, Chef, Github Spark, v0, Firebase Studio, AWS App Studio, Cursor, Windsurf, Claude Code, Codex, Figma Code, or Canva Code? (And that's just the tip of the iceberg.) Some may use different models, but even that layer is close to being commoditized.

So how are entrepreneurs supposed to stand out?

The new frontier in the digital world will probably be vertical AI, or what we call SaaS 3.0. These are tools built for specific industries, workflows, companies, or even individual users. Here, differentiation does not come from the model or UI, but from data, domain expertise, and deep trust.

Rohan D’Souza, founder of Avante, a health benefits admin platform and Ascend portfolio company recently wrote in a post:“The model is the tiniest piece of a much larger enterprise stack required to actually deliver value.”

In other words, the real moat is not the model itself. It’s the safety, reliability, domain-specific workflows, and trust built around it.

I believe the digital frontier is only half the story. For decades, the most dramatic technological shifts happened on screens and servers. As Marc Andreessen famously put it: "Software is eating the world." It took a while, but AI is breaking out of code and moving into the physical world — biotech, robotics, manufacturing, logistics, and more.

AI in the physical world is far more defensible. The machines it runs are harder to replicate, and the technical nuances go deeper than traditional software alone. (Ascend labels this category Frontier AI).

Despite OpenAI's partnership with Anduril, the demand for homegrown physical tech alternatives is only growing. For instance, in 2022 the American Security Drone Act banned federal agencies from using Chinese-made drones and parts. Around that time, some of my college friends were running Uniform Sierra, an aerospace startup focused on building high-quality drones in the U.S. They scaled with a 3D printer farm as demand surged, and the company was recently acquired. More startups, like Seattle-based drone startup Brinc, are reshoring their manufacturing apparatus.

So did OpenAI kill 100,000 startups? Probably a few thousand. Mine for sure. But in my defense, I built a chat app, a marketplace, and a social media site before OpenAI did. I have the right ideas. I could have kept going — and I still probably would have been steamrolled.

My chat app worked because there were no others like it at the time. I knew back then it wouldn't last. LLMs were too good to stay secret, and OpenAI would productize them better than I could using its API. I knew I had to differentiate. Now chat apps are a dime a dozen.

Differentiation mattered then. It matters even more now, especially with trillion-dollar tech giants pivoting their entire product suites into AI. Timing might get you started, but differentiation keeps you going.