ChatGPT’s Answers To Software Engineering Questions Were 52% Incorrect – News18

Published By: Bharat Upadhyay

Last Updated: August 13, 2023, 15:16 IST

New York, United States of America (USA)

ChatGPT additionally suffers from different high quality points corresponding to verbosity

Despite ChatGPT’s reputation, there hasn’t been an intensive investigation into the standard and usefulness of its responses to software program engineering queries, stated researchers from the Purdue University within the US.

OpenAI’s ChatGPT answered about 52 per cent software program engineering questions incorrectly, in line with a examine, elevating questions concerning the standard language fashions accuracy.

Despite ChatGPT’s reputation, there hasn’t been an intensive investigation into the standard and usefulness of its responses to software program engineering queries, stated researchers from the Purdue University within the US.

To deal with this hole, the group undertook a complete evaluation of ChatGPT’s replies to 517 questions from Stack Overflow (SO).

“Our examination revealed that 52 per cent of ChatGPT’s solutions comprise inaccuracies and 77 per cent are verbose,” the researchers wrote in the paper, not peer-reviewed and published on a pre-print site.

Importantly, the team found that 54 per cent of the time the errors were made due to ChatGPT not understanding the concept of the questions.

Even when it could understand the question, it failed to show an understanding of how to solve the problem, contributing to a high number of conceptual errors, they said.

Further, the researchers observed ChatGPT’s limitation to reasoning.

“In many cases, we saw ChatGPT give a solution, code, or formula without foresight or thinking about the outcome,” they stated.

“Prompt engineering and human-in-the-loop fine-tuning could be useful in probing ChatGPT to know an issue to some extent, however they’re nonetheless inadequate with regards to injecting reasoning into LLM. Hence it’s important to know the elements of conceptual errors in addition to repair the errors originating from the limitation of reasoning,” they added.

Moreover, ChatGPT also suffers from other quality issues such as verbosity, inconsistency, etc. Results of the in-depth manual analysis pointed to a large number of conceptual and logical errors in ChatGPT answers. The linguistic analysis results showed that ChatGPT answers are very formal, and rarely portray negative sentiments.

Nevertheless, users still preferred ChatGPT’s responses 39.34 per cent of the time due to its comprehensiveness and articulate language style.

“These findings underscore the need for meticulous error correction in ChatGPT while also raising awareness among users about the potential risks associated with seemingly accurate answers,” the researchers stated.

(This story has been edited by News18 employees and is revealed from a syndicated news company feed – IANS)

Bharat Upadhyay

Bharat Upadhyay, Senior Sub-Editor at News18 Tech, writes about know-how and shopper devices. He has been masking the know-how beat for over six…Read More

Source web site: www.news18.com

Post Views: 83

Despite ChatGPT’s reputation, there hasn’t been an intensive investigation into the standard and usefulness of its responses to software program engineering queries, stated researchers from the Purdue University within the US.

Best fuel burners: Top 10 picks for diverse cooking wants and temperature management

Best transportable fuel range: Top 10 decisions for outside adventures & tenting

Mini water coolers: 5 transportable picks that will help you keep cool in scorching summers

Best LG fridges: Top 8 picks for superior efficiency and quiet operation

Best air cooler with out water: Top 7 cost-effective and eco-friendly choices

Apple Will Bring 120Hz OLED Display To All iPhone 17 Series Models Next Year: Report – News18

Best coolers in India: 10 top-rated and standard air coolers for you

Best two burner gasoline ovens: Top 10 compact options for small areas