Stop Thinking, Just Do!

Sungsoo Kim's Blog

Visual Prompting Livestream With Andrew Ng

tagsTags

25 July 2023


Article Source


Visual Prompting Livestream With Andrew Ng

  • Try out Visual Prompting today at https://landing.ai
  • And join the conversation in the Landing AI community at community.landing.ai/c/visual-prompting/

Abstract

[About Visual Prompting Livestream]

The traditional AI modeling workflow requires multiple steps: (i) finding and labeling data, (ii) training a model, and then (iii) making predictions. In contrast, text interfaces like ChatGPT have a dramatically simpler process where a user can give a text prompt saying what they want, and get an answer quickly. This has revolutionized NLP (natural language processing).

Traditional AI: Label→ Train → Predict (taking days/months)

Prompting based AI: Prompt → Predict (taking minutes/seconds)

In this livestream presentation, Andrew Ng will share some early thoughts – and present Landing AI’s results – on generalizing this concept from text to computer vision, so that users can input a simple Visual Prompt (that indicates a few things on an image) and quickly get a result. Meta’s SAM model is one example of Visual Prompting, applied to image segmentation of individual images. In the next few years, Andrew expects that Visual Prompting tools will make computer vision much more accessible, just as text prompting has for NLP.

Join Andrew to discuss:

  • Lessons from NLP (transformers and large language models) for computer vision (vision transformers, foundation vision models)
  • Visual Prompting as a fast, easy and natural way to have a vision-based interaction
  • Live demo
  • Implications of prompting on machine learning project lifecycle
  • Live Q&A

comments powered by Disqus