How to Use GPT-4’s Multimodal Capability in Bing Chat Right Now
While OpenAI is yet to release its most anticipatedmultimodal feature to GPT-4, which lets you upload images and ask questions related to them, unsurprisingly, Microsoft has rolled out early access to the image upload feature. Yeah, you can now upload images toBing Chatand chat with the GPT-4 model. It works just like OpenAI demonstrated during the GPT-4 launch.
With the multimodal feature, Bing Chat has basically received vision capabilities, and it can now understand images as well. You can use it to study medical reports, get nutritional data about food, solve mathematical questions, and much more. Now, to learn how to use GPT-4’s multimodal capability in Bing Chat, follow along this tutorial.
-
First, launch Microsoft Edge andopen Bing(visit) on your computer. You can also install the Bing app (AndroidandiOS, Free) on your smartphone too.
-
Next, click on “Chat” in the top-left corner.
-
Once you are here, move to the “Creative” mode as it lets youchat with the GPT-4 model for free.
-
Now, you will find an“image” buttonin the text field below. This will allow you to upload an image and access the GPT-4 multimodal feature.
-
Click on the image button andupload an image file. You can also paste the image URL if you want.
-
I have uploaded an image of a website that I quickly scribbled on a piece of paper. Now, let’s ask Bing Chat tocreate a websitelike this and generate HTML and CSS code for the website.
-
And well, there you have it. Based on GPT-4, Bing Chat uses its multimodal capabilities togenerate the HTML and CSS coderight away.
-
After pasting the code and running it, here is the website you get. Not bad, right? It correctlypicked my handwritingand the layout is similar too. And that’s how GPT-4’s multimodal capability in Bing Chat works.
-
In another example, I uploaded a complexCAD designof a house and asked it several questions, ranging from iron quantity to design-related questions, and it did a fabulous job.
-
Next, I asked Bing Chat to solve twomathematical questions, and it solved both of them correctly.
-
Finally, to round up, I uploaded afunny cartoonand asked Bing Chat to explain the joke. But this time, it failed to get the joke. Nevertheless, GPT-4’s multimodal feature is insanely powerful and there are limitless use cases that you can try.
Arjun Sha
Passionate about Windows, ChromeOS, Android, security and privacy issues. Have a penchant to solve everyday computing problems.
Add new comment
Name
Email ID
Δ
01
02
03
04
05