Gemini API File Search is now multimodal

FrequentLurker · 58 days ago

This might be great and all but I am still miffed at how simple search on AI Studio is. You can only search the titles of your conversations and nothing inside them. On top of that they messed with the scrolling so Ctrl+F doesn't work reliably.

stingraycharles · 58 days ago

Yeah, it’s surprising, Claude Desktop has had project files since decades which are chunked/indexed and automatically injected into your context based on the topic.

You’d think this would be fairly obvious for Google to do, but it’s probably an organizational problem rather than a technical one.

sega_sai · 58 days ago

The search in Gemini app in the browser is so embarrassingly bad that I get an impression that nobody of importance in Google must be using it otherwise they would have fixed long ago.

cold_gate · 58 days ago

There's decent evidence in the HCI literature that internal dogfooding breaks down at scale — teams use polished internal builds, not the same degraded experience shipping to users. As far as I can tell, that's likely what's happening here.

varispeed · 58 days ago

I am more miffed that you cannot delete conversations.

bpa84 · 58 days ago

A search company that can't search its own product.

greesil · 58 days ago

Too bad they can't just easily vibe code new features.

lousken · 58 days ago

Haven't touched gemini api since they did not support having a $ limit per api key. Is it possible now?

jwithington · 58 days ago

Yes https://ai.google.dev/gemini-api/docs/billing#spend-caps

ben398 · 58 days ago

Stripe had the same problem in 2015, spend caps added two years later after enough enterprise escalations. At least Google took less time. The 10min delay sounds like a rate-limiting workaround someone hacked in.

algoth1 · 58 days ago

With a 10min delay, via aistudio

ecommerceguy · 57 days ago

My free trial ends this week, which i'm obviously canceling.

FirstPoint · 58 days ago

It’s a striking irony that the world's leader in search is receiving so much heat for poor search functionality and UX within its own flagship AI products

WarmWash · 58 days ago

One of Googles core problems is internal silos of talent. The search team has likely never interacted with the Gemini app team or perhaps even the Gemini app.

For all intents and purposes Google Gemini is a totally separate company from Google search.

dneal · 58 days ago

But does the search team's expertise even transfer? Retrieval over web-scale crawls vs. retrieval over user-uploaded docs are pretty different problems — different latency tolerances, corpus characteristics, everything.

noashavit · 57 days ago

The race for unstructured data continues. It feels like everyone is trying to crack unstructured data extraction with the underlying goal of ultimately using AI to classify and tag insights from unstructured data to create a structured data/graphs for agents to consume and traverse.

ninjagoo · 57 days ago

Is anyone tech-savvy going to actually let any tool with this backend run on their personal PCs?

Any app with this behind the scenes is a non-starter for me.

And anyone think that all those folks ditching Win11 will be going for or recommending any app built on this?

thawab · 57 days ago

Tried multiple times to use the api file search and it’s complex to setup. Ended up going a different approach.

trilogic · 58 days ago

Good to have a choice between clouds and local use.

How much would you pay to have this yours forever, running locally, GDPR and HIPaa compliant, without the headache of privacy or subscriptions.

That´s what we offer with HugstonOne and we did it before Google. Multimodal, Lighting fast RAG, terabytes not kilobytes only :)

All you need is a 32gb ram laptop and HugstonOne, not a rocket science.

gmf18 · 57 days ago

Multimodal RAG sounds exciting until your file index gets stale at 3am and every query returns confident nonsense. No alerting for that, naturally.