The era of chasing bigger token limits is over—here's what actually matters now

Hacker News·2h·theahura

As LLM context windows have grown absurdly large, the optimization game has shifted. Token efficiency and smarter retrieval are replacing the raw "more tokens = better" mentality. For indie builders relying on API costs, this means focusing on architectural choices that cut waste rather than betting on model capacity.