nkkbr
/

ViCA

Video-Text-to-Text

text-generation

vision-language

video understanding

spatial reasoning

visuospatial cognition

Eval Results (legacy)

Model card Files Files and versions

ViCA / assets /table3.png

nkkbr's picture

update readme

ed35572 about 1 year ago

history blame contribute delete

587 kB

Xet Pointer Details

( Raw pointer file )

Xet hash:: a72b523d22fc272584bff1c56d03180dc3be12768d1d54e69e868425ef5d2007
Size of remote file:: 587 kB
SHA256:: 2e8c4c5e9ef49ddf256ad5006536800feed6b3153af9c9b78395e66bf106f637

Xet efficiently stores Large Files inside Git, intelligently splitting files into unique chunks and accelerating uploads and downloads. More info.