CLIP: Connecting textual content and photographs



We’re introducing a neural community referred to as CLIP which successfully learns visible ideas from herbal language supervision. CLIP may also be implemented to any visible classification benchmark through merely offering the names of the visible classes to be identified, very similar to the “zero-shot” features of GPT-2 and GPT-3.


Leave a Comment

Your email address will not be published. Required fields are marked *