Tuesday, September 9, 2025

The Context Window Swindle: Why Longer Isn't Better

AI vendors are racing to offer million-token context windows.  But... they're not doing it for your benefit.

Remember when 4K context windows felt limiting?  Then 8K arrived.  Then 32K.  Now we're being sold 128K, 200K and even million-token windows as the solution to all our AI problems.  Sadly, it's all a con.

Here's what they're not telling you.

How AI Conversations Actually Work

Every message you send doesn't just process your latest prompt.  The AI re-reads the entire conversation history - every single token - to generate each response.

Send a 100-word question in a fresh chat?  You're using roughly 100 tokens.  Send that same question after an hour of back-and-forth?  You might be burning through 50,000 tokens for the same answer.

The cost isn't linear - it's exponential.  Each message gets more expensive as your conversation grows, and those costs compound fast.  Think Cookie Monster but for tokens!

The Lazy Prompting Tax

Larger context windows enable sloppy habits. Why bother being precise when you can just dump everything into the conversation and let the AI "figure it out"?  Why manage your prompts when the context window is supposedly infinite?

This laziness has a price tag.  A substantial one.

A well-structured conversation with clear prompts might cost pennies.  That same interaction stretched across a rambling 50-message thread could cost pounds - delivering identical results while training you to be less effective.

The Platform Play

AI platform owners understand something most users don't: confusion is profitable.

Larger context windows sound like features.  They feel like generosity. In reality, they're revenue optimization strategies that shift costs to users who don't understand the pricing model.

The more context you use, the more you pay - but the pricing structure obscures this reality until your bill arrives.

What Actually Works

Effective AI use requires discipline:

  • Start fresh conversations for new topics rather than extending existing threads
  • Be precise with prompts instead of relying on accumulated context
  • Extract and restate key information rather than referencing earlier conversation points
  • Structure your approach before dumping content into AI tools

This isn't just about cost control.  Focused, well-managed conversations produce better results.  Shorter contexts mean the AI spends less computational effort tracking irrelevant history and more on your actual problem.

The Lanboss Perspective

At Lanboss AI, we help organisations implement AI without falling for vendor marketing.  Understanding how these tools actually work - and what they actually cost - is fundamental to safe, cost-effective AI adoption.

Larger context windows aren't inherently bad.  But treating them as an invitation to lazy prompting and unmanaged conversations is expensive and counterproductive.

TL;DR Million-token context windows are a feature designed to increase platform revenue, not improve your results.  Disciplined prompting and conversation management will save you money and deliver better outcomes.

Tuesday, May 27, 2025

GCS Leader Podcast Series

I have been most honoured to be featured in the GCS Leaders Podcast series.  David Bloxham and I have a fun and meandering conversation about the myths and realities of AI, where it is today and how to safely use AI to your advantage.

Wednesday, April 16, 2025

AI Action Figures, cute but not best use of AI

The recent viral fad to create cute, self styled action figure pictures of yourself or a favourite pet is not the best use of AI ever imagined.  Each image uses lots of GPU time and that means lots of energy.  An article on Techradar raises concerns about this massive misuse of AI fueled by this FOMO trend.  Even Sam Altman likes the idea, but he says it's melting the OpenAI GPUs.

Monday, March 31, 2025

AI Over Hyped?

Over the past month ro so we have seen a growing number of articles about AI market over hype and in particular, Microsoft which has pulled back from plans to build new data centres and also from letters of intent relating to purchase of 2 Gigawatts (GW) of energy, representing a significant scaling back.  The general feeling would appear to be that AI's energy consumption is already too high and delivering the promises will exasperate that further.

Wednesday, February 26, 2025

AI Literacy: A New Legal Requirement Under the EU AI Act – Are You Ready?

The EU AI Act is the world’s first comprehensive law governing AI, setting a precedent for how organizations across industries must handle AI responsibly.  Among its many provisions, a crucial but often overlooked requirement is AI literacy—a mandate that ensures employees working with AI have the necessary skills and understanding to assess AI risks and opportunities effectively.

Tuesday, February 25, 2025

From BPR to AI: Embracing a Pragmatic Journey of Transformation

In today's fast-paced business landscape, AI adoption is emerging as a transformative strategy that mirrors the revolutionary approaches of the past. Much like the 1990s Business Process Re-engineering (BPR) and the 2000s Kanban and Lean methodologies, integrating AI into your operations is about optimizing processes, enhancing decision-making and securing a competitive edge.  However, the journey to successful AI implementation is neither instantaneous nor magical.

Learning from the Past: A Legacy of Transformation

Wednesday, February 12, 2025

Elon Musk and OpenAI: Analyzing a Potential Reunion's Impact on AI

The AI industry found itself at another potential inflection point with Elon Musk's expressed interest in acquiring OpenAI, the company he co-founded in 2015 before departing in 2018.  This development is particularly noteworthy given the complex history between Musk and OpenAI, as well as the current state of the AI industry.

The Context Window Swindle: Why Longer Isn't Better

AI vendors are racing to offer million-token context windows.  But... they're not doing it for your benefit. Remember when 4K context w...