Budapest Post

Cum Deo pro Patria et Libertate
Budapest, Europe and world news

Students Beat ChatGPT At This Exam, Score 76%, Compared To Chatbot's 47%

Students Beat ChatGPT At This Exam, Score 76%, Compared To Chatbot's 47%

Despite this, they said that ChatGPT's performance was "impressive" and that it was a "game changer that will change the way everyone teaches and learns - for the better."
Researchers found students to have fared better at accounting exams than ChatGPT, OpenAI's chatbot product.

Despite this, they said that ChatGPT's performance was "impressive" and that it was a "game changer that will change the way everyone teaches and learns - for the better."

The researchers from Brigham Young University (BYU), US, and 186 other universities wanted to know how OpenAI's technology would fare on accounting exams. They have published their findings in the journal Issues in Accounting Education.

In the researchers' accounting exam, students scored an overall average of 76.7 per cent, compared to ChatGPT's score of 47.4 per cent.

While in 11.3 per cent of the questions, ChatGPT was found to score higher than the student average, doing particularly well on accounting information systems (AIS) and auditing, the AI bot was found to perform worse on tax, financial, and managerial assessments. Researchers think this could possibly be because ChatGPT struggled with the mathematical processes required for the latter type.

The AI bot, which uses machine learning to generate natural language text, was further found to do better on true/false questions (68.7 per cent correct) and multiple-choice questions (59.5 per cent), but struggled with short-answer questions (between 28.7 and 39.1 per cent).

In general, the researchers said that higher-order questions were harder for ChatGPT to answer. In fact, sometimes ChatGPT was found to provide authoritative written descriptions for incorrect answers, or answer the same question different ways.

They also found that ChatGPT often provided explanations for its answers, even if they were incorrect. Other times, it went on to select the wrong multiple-choice answer, despite providing accurate descriptions.

Researchers importantly noted that ChatGPT sometimes made up facts. For example, when providing a reference, it generated a real-looking reference that was completely fabricated. The work and sometimes the authors did not even exist.

The bot was seen to also make nonsensical mathematical errors such as adding two numbers in a subtraction problem, or dividing numbers incorrectly.

Wanting to add to the intense ongoing debate about how how models like ChatGPT should factor into education, lead study author David Wood, a BYU professor of accounting, decided to recruit as many professors as possible to see how the AI fared against actual university accounting students.

His co-author recruiting pitch on social media exploded: 327 co-authors from 186 educational institutions in 14 countries participated in the research, contributing 25,181 classroom accounting exam questions.

They also recruited undergraduate BYU students to feed another 2,268 textbook test bank questions to ChatGPT. The questions covered AIS, auditing, financial accounting, managerial accounting and tax, and varied in difficulty and type (true/false, multiple choice, short answer).
AI Disclaimer: An advanced artificial intelligence (AI) system generated the content of this page on its own. This innovative technology conducts extensive research from a variety of reliable sources, performs rigorous fact-checking and verification, cleans up and balances biased or manipulated content, and presents a minimal factual summary that is just enough yet essential for you to function as an informed and educated citizen. Please keep in mind, however, that this system is an evolving technology, and as a result, the article may contain accidental inaccuracies or errors. We urge you to help us improve our site by reporting any inaccuracies you find using the "Contact Us" link at the bottom of this page. Your helpful feedback helps us improve our system and deliver more precise content. When you find an article of interest here, please look for the full and extensive coverage of this topic in traditional news sources, as they are written by professional journalists that we try to support, not replace. We appreciate your understanding and assistance.
Newsletter

Related Articles

0:00
0:00
Close
Japanese Technology Firm Fujitsu Launches Advanced Artificial Intelligence Tool for Corporate Disclosures
South Africa Officially Launches Nationwide Campaign for Highly Contested Local Government Elections
United Kingdom Commits Additional Funding for Unexploded Ordnance Clearance in Laos
Singapore Announces Stringent New Greenhouse Gas Regulations for Commercial Cooling Systems
Cambodia and Thailand Hold High-Level Border Security Talks at United Nations Headquarters
Myanmar Military Government and China Sign Major Agreement to Upgrade Media and Cultural Cooperation
Knife Attack at Swiss Train Station Leaves Three Injured in Suspected Act of Domestic Terrorism
Transnational Extortion Gang Threatens Canadian Police With Army of One Thousand Armed Operatives
Australia Imposes Forty-Two-Day Quarantine on Cruise Ship Passengers Following Deadly Hantavirus Outbreak
International Monetary Fund Unlocks Seven Hundred Million United States Dollars for Sri Lanka Following Economic Reforms
Australia Launches Record One Point Four Billion Dollar Lawsuit Against Chemical Giant 3M Over Contamination
China and Canada Foreign Ministers Meet in Ottawa in Effort to Stabilize Strained Diplomatic Ties
Indonesia Demands Urgent United Nations Security Council Reform Amid Escalating Global Conflicts
Extreme Weather Patterns Trigger Severe Drought in Madagascar and Destructive Flooding in East Africa
Indian State of Karnataka Faces Political Upheaval as Chief Minister Siddaramaiah Abruptly Resigns
Philippines and Japan Reaffirm Defense Ties as Crucial for Indo-Pacific Regional Stability
Norway Joins French Nuclear Deterrence Initiative in Major Shift for European Security Architecture
Global Critical Mineral Alliances Expand as Western Nations Move to Counter Chinese Supply Dominance
United States Imposes Fifty Percent Tariffs on Mexican Steel and Aluminum Ahead of Trade Pact Review
European Union and China Head Toward Major Trade Conflict Over Clean Technology Exports
United States Economic Growth Severely Downgraded to One Point Six Percent as Stagflation Fears Mount
World Health Organization Warns Central African Ebola Epidemic is Outpacing Containment Efforts
United States Treasury Department Conditions Sanctions Relief on Reopening of the Strait of Hormuz
Iranian Air Defenses Intercept and Destroy United States Military Drone Over Bushehr Province
Iranian Armed Forces Launch Ballistic Missiles Toward Unspecified Targets Prompting Regional Condemnation
United Nations Secretary-General Warns Global Order Facing Highest Level of Conflict Since 1945
Israel Issues Sweeping Evacuation Orders in Southern Lebanon Amid Intensified Hezbollah Conflict
Russia Announces Systemic Military Strikes Targeting Ukrainian Defense and Energy Infrastructure
United States and Iranian Negotiators Reach Draft Agreement to Extend Ceasefire and Resume Nuclear Talks
United Nations Security Council Deeply Divided Over United States Capture of Venezuelan President
US and Iran Exchange Direct Military Strikes Amid Fragile Gulf Ceasefire
World Health Organization Warns of Catastrophic Ebola Outbreak in DR Congo
Russia Threatens New Wave of Strikes on Ukrainian Infrastructure and Embassies
Scientists Warn Atlantic Ocean Currents Could Collapse Faster Than Projected
Anthropic Reaches $900 Billion Valuation in Historic AI Funding Round
Washington Imposes Crippling Sanctions on Iranian Maritime Authority
Japan and the Philippines Initiate Strategic Intelligence-Sharing Pact
Microsoft Deploys Autonomous Computer-Using AI Agents to Global Markets
Anthropic Secures $45 Billion Compute Infrastructure Agreement With SpaceX
U.S. Director of National Intelligence Resigns Amid Administration Shakeup
Micron Technology Crosses Trillion-Dollar Valuation Amid Unprecedented Hardware Demand
Canada and Germany Finalize Historic Long-Term LNG Export Agreement
China Expands International Travel Restrictions on Domestic AI Researchers
Japan Approves Sweeping Overhaul of National Intelligence Apparatus
Global Airlines Scramble Logistics as Middle East Airspace Remains Fractured
Japan's Naphtha Imports Plunge 47 Percent Amid Strait of Hormuz Closure
Global Crude Prices Retreat Below $96 as Gulf Tensions Momentarily Ease
Generative AI Outperforms Human Baselines in Landmark Global Creativity Study
NASA Partners With Private Aerospace to Unveil Permanent Lunar Base Architecture
South Korean Equity Markets Surge on Next-Generation Memory Chip Frenzy
×