You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe.
for multimodel model training,their must need a image-text pair dataset, and it is possible to be distilled from foundation multimodelmodel, is it possible to create a function for situation like this?
Describe the solution you'd like
upload a image zip folder and some system instruction to generate like question-image-answer triplet or image-caption pair dataset