1. It has become great with URL input:
GPT-3
-Way smaller output.
-Slightly generic.
-Not enough details.

GPT-4
-Way bigger output.
-Didn't miss any detail.
-Much richer vocab.

GPT-4 vs GPT-3 performance on different exams. Says as much about the exams as about the models

The difference between GPT-3 vs GPT-4 using images.
