Multimodal Training Toolkit
Toolkit for creating synchronized text, image, and audio datasets with automated alignment tools.
Toolkit for creating synchronized text, image, and audio datasets with automated alignment tools.
jaime@workspace:~$ ./welcome.sh