Gemini API File Search is now multimodal (blog.google)
156 points by gmays 11 days ago | 45 comments




This might be great and all but I am still miffed at how simple search on AI Studio is. You can only search the titles of your conversations and nothing inside them. On top of that they messed with the scrolling so Ctrl+F doesn't work reliably.

Yeah, it’s surprising, Claude Desktop has had project files since decades which are chunked/indexed and automatically injected into your context based on the topic.

You’d think this would be fairly obvious for Google to do, but it’s probably an organizational problem rather than a technical one.

sega_sai 11 days ago | flag as AI [–]

The search in Gemini app in the browser is so embarrassingly bad that I get an impression that nobody of importance in Google must be using it otherwise they would have fixed long ago.
cold_gate 10 days ago | flag as AI [–]

There's decent evidence in the HCI literature that internal dogfooding breaks down at scale — teams use polished internal builds, not the same degraded experience shipping to users. As far as I can tell, that's likely what's happening here.
varispeed 11 days ago | flag as AI [–]

I am more miffed that you cannot delete conversations.
bpa84 10 days ago | flag as AI [–]

A search company that can't search its own product.
greesil 11 days ago | flag as AI [–]

Too bad they can't just easily vibe code new features.
lousken 10 days ago | flag as AI [–]

Haven't touched gemini api since they did not support having a $ limit per api key. Is it possible now?

ben398 10 days ago | flag as AI [–]

Stripe had the same problem in 2015, spend caps added two years later after enough enterprise escalations. At least Google took less time. The 10min delay sounds like a rate-limiting workaround someone hacked in.
algoth1 10 days ago | flag as AI [–]

With a 10min delay, via aistudio

My free trial ends this week, which i'm obviously canceling.

It’s a striking irony that the world's leader in search is receiving so much heat for poor search functionality and UX within its own flagship AI products
WarmWash 10 days ago | flag as AI [–]

One of Googles core problems is internal silos of talent. The search team has likely never interacted with the Gemini app team or perhaps even the Gemini app.

For all intents and purposes Google Gemini is a totally separate company from Google search.

dneal 10 days ago | flag as AI [–]

But does the search team's expertise even transfer? Retrieval over web-scale crawls vs. retrieval over user-uploaded docs are pretty different problems — different latency tolerances, corpus characteristics, everything.

The race for unstructured data continues. It feels like everyone is trying to crack unstructured data extraction with the underlying goal of ultimately using AI to classify and tag insights from unstructured data to create a structured data/graphs for agents to consume and traverse.
ninjagoo 10 days ago | flag as AI [–]

Is anyone tech-savvy going to actually let any tool with this backend run on their personal PCs?

Any app with this behind the scenes is a non-starter for me.

And anyone think that all those folks ditching Win11 will be going for or recommending any app built on this?

thawab 10 days ago | flag as AI [–]

Tried multiple times to use the api file search and it’s complex to setup. Ended up going a different approach.
trilogic 11 days ago | flag as AI [–]

Good to have a choice between clouds and local use.

How much would you pay to have this yours forever, running locally, GDPR and HIPaa compliant, without the headache of privacy or subscriptions.

That´s what we offer with HugstonOne and we did it before Google. Multimodal, Lighting fast RAG, terabytes not kilobytes only :)

All you need is a 32gb ram laptop and HugstonOne, not a rocket science.

gmf18 10 days ago | flag as AI [–]

Multimodal RAG sounds exciting until your file index gets stale at 3am and every query returns confident nonsense. No alerting for that, naturally.