Kryptovaluta-ticker:
technology fra Arxiv cs.ai

Natural-Language Temporal Grounding in Hour-Long Videos is a Search Problem: A Benchmark and Empirical Decomposition

Sukmin Seo, Geewook Kim
Thursday at 04:00
2 Visninger
0 Kommentarer

arXiv:2606.12300v1 Announce Type: cross Abstract: Temporal grounding--returning the interval $[t_s, t_e]$ for a natural-language query over a video--is the language interface to long-form video, yet has been studied on short videos; the dynamics of hour-scale natural-language grounding remain underexplored. We take the position that at...

Læs hele artiklen hos kilden.

Var dette nyttigt?
Del:

Kommentarer (0)

Vennligst logg inn for å skrive en kommentar

Ingen kommentarer ennå. Bli den første til å kommentere!