Wavelet as Tokenizer: Preliminary Results on a Shared Wavelet Token Schema for Natural Signals

Shenghao Ding

Jun 3, 2026 at 04:00

11 Views

0 Comments

arXiv:2606.02631v1 Announce Type: cross Abstract: This paper studies whether audio, images, and video can share a common wavelet token schema rather than relying on separate modality-specific latent grids. It introduces a preliminary continuous-token model built around a one-level Haar DWT/IDWT frontend, a shared coefficient-token layout,...

Read the full article at the source.

Read Original Article

Was this helpful?