OpenAI has unveiled its latest AI models, o3 and o4-mini, which are said to be highly efficient and perform well in test results. However, these models are still facing ongoing issues.
Information released in the System Card by OpenAI regarding models o3 and o4-mini discusses hallucinations in the PersonQA test set. Both models exhibit a higher rate of hallucinations compared to the older o1 model.
OpenAI admits that they cannot yet explain why the newer models are experiencing more hallucinations than the less capable older models, which goes against the expected trend. Further research and investigation are needed to uncover the reasons behind this phenomenon.
Source: TechCrunch
TLDR: OpenAI introduces new AI models o3 and o4-mini, showing high performance but also suffering from increased hallucinations compared to older models. Research is needed to understand this unexpected issue.
Leave a Comment