Affiliate links on Android Authority may earn us a commission. Learn more.
9 mind-blowing things you can do with ChatGPT-4 Vision
Not long ago, OpenAI unveiled a new iteration of ChatGPT, known as ChatGPT-4V or 4 Vision. This version allows users to upload images, photos, text, or mathematical problems, and it can analyze these and respond to questions based on the uploaded image. This remarkably powerful feature is currently only available to ChatGPT Plus account holders. If you happen to be a subscriber, here are nine things you can do with ChatGPT-4 Vision.
For a full demonstration of how to use ChatGPT-4 Vision to accomplish these tasks, be sure to watch the video embedded above.
Identify and contextualize objects
I started with something simple: a picture of a house plant that looked like a cabbage growing in a pot. I asked ChatGPT with Vision to identify it. It turns out it’s an ornamental kale or cabbage, known for its vibrant and colorful leaves and often used for decorative purposes.
Convert handwritten notes into code
One of the impressive features of ChatGPT-4 Vision is its ability to read handwritten notes and diagrams. I tested it with a flow chart that describes a simple loop. Despite the poor handwriting and drawing, ChatGPT managed to interpret it accurately and even converted it into Python code.
Convert charts into tables
I also used a chart from a recent video about the Tensor G3 chipset, which shows Geekbench 6 multi-core scores. I asked ChatGPT to convert this into a table, and it did so easily. This feature can be handy for converting graphical data into a more manageable format.
Calculate visual math problems
Next, I presented it with a visual math puzzle involving fireworks and stars. Despite some color confusion, ChatGPT correctly identified the fireworks that hadn’t been launched.
Describe and analyze charts
I then uploaded a US dollar to Euro currency conversion chart covering a period of one year. ChatGPT accurately described the chart and even provided some analysis of the value of the US dollar compared to the Euro. However, it’s important to note that ChatGPT should not be used for financial or medical advice.
Transcribe text in an image
For those interested in family history research, ChatGPT can be a useful tool. I uploaded an image of a UK census document from 1851 and asked ChatGPT to transcribe it. Despite a minor error in transcribing a surname, it did a commendable job.
Interpret complex images created by AI
I also tested it with an AI-generated image of a seascape with two moons. ChatGPT provided a detailed image description, including the smallest elements, demonstrating its ability to interpret and describe complex visuals.
Create lesson plans for teachers
Finally, I gave it an image of an unbalanced binary tree and an AVL tree and asked it to create a lesson plan for a high school computer science class based on the image. It developed a comprehensive lesson plan, demonstrating its potential as an educational tool.
Read “hidden” messages
I uploaded a seemingly blank yellow image with a hidden message in a fun final test. ChatGPT successfully read the hidden message written in a color that’s barely noticeable to the naked eye. This demonstrates ChatGPT-4 Vision’s ability to detect subtle color differences.
In conclusion, ChatGPT-4 Vision is a powerful tool with many applications, from image analysis to educational planning. It’s an exciting development in the field of AI, and I look forward to seeing how it advances.