Budapest Post

Cum Deo pro Patria et Libertate
Budapest, Europe and world news

Google’s SummAE AI generates abstract summaries of paragraphs

Google’s SummAE AI generates abstract summaries of paragraphs

Google researchers propose a novel AI summarization model - SummAE- capable of generating abstract summaries of paragraphs.
Machines have a tougher time summarizing text than you’d think, at least where the summarization is abstractive rather than extractive. While the extraction requires merely concatenating sentences, abstraction involves the task of paraphrasing using novel sentences. Progress has been made in the news domain recently, perhaps owing to the abundance of corpora on which algorithmic systems can be trained. But robust summarization of most other writing forms remains an unsolved problem.

Motivated by this, a team at Google Brain investigated an abstractive summarization system dubbed SummAE that’s largely unsupervised, meaning it’s able to generalize from a small amount of training data to unseen textual examples. While it couldn’t summarize beyond single five-sentence paragraphs, the researchers claim it “significantly” improves upon the baseline and represents a “major” step in the direction of human-level performance.


Machines have a tougher time summarizing text than you’d think, at least where the summarization is abstractive rather than extractive. While the extraction requires merely concatenating sentences, abstraction involves the task of paraphrasing using novel sentences. Progress has been made in the news domain recently, perhaps owing to the abundance of corpora on which algorithmic systems can be trained. But robust summarization of most other writing forms remains an unsolved problem.

Motivated by this, a team at Google Brain investigated an abstractive summarization system dubbed SummAE that’s largely unsupervised, meaning it’s able to generalize from a small amount of training data to unseen textual examples. While it couldn’t summarize beyond single five-sentence paragraphs, the researchers claim it “significantly” improves upon the baseline and represents a “major” step in the direction of human-level performance.

Recommended videosPowered by AnyClip
Go Eat A McRib
Play

Unmute
Duration
0:59
/
Current Time
0:17

Fullscreen
Up Next

NOW PLAYINGGo Eat A McRib
Scientists Discover What Makes 'Water Bears' Virtually Indestructible
Doctor diagnoses his own cancer with an app
There's A Bigger Danger To Pedestrians Than Walking While Distracted
Prince Harry to edit National Geographic's Instagram
The Secret Culprit Of America's Student Debt Crisis
5 Quotes About The Power of Books

The data set and code are freely available on GitHub, along with the configuration settings for the best model.

“As one of the very first works approaching single-document [abstract summarization], we propose a novel neural model — SummAE,” wrote the coauthors. “[We believe it] is therefore desirable to have models capable of automatically summarizing documents abstractively with little to no supervision.”

SummAE contains a denoising autoencoder that encodes (that is, generates numerical representations of) sentences and paragraphs of the target text in a shared space. Guided by a decoder whose input is prepended with a token signaling whether to decode a sentence or a paragraph, the system generates summaries by decoding each sentence from the encoded paragraphs.

The researchers discovered that most traditional approaches to training the auto-encoder resulted in long, multi-sentence summaries. To encourage it to learn higher-level concepts disentangled from their original expression, the team employed two denoising approaches — randomly masking tokens and permuting the order of sentences within paragraphs — that increased the number of training examples substantially. They also experimented with an adversarial critic component that could distinguish between sentences and paragraphs, in addition to two pretraining tasks that encouraged the encoder to learn how sentences narratively followed within a paragraph.

The researchers trained three different variations of SummAE on the ROCStories, a corpus of self-contained, diverse, non-technical, and concise prose. They split the original 98,159 training stories into three separate collections — a training set, a validation set, and a test set — and collected three human summaries each for 500 validation examples and 500 test examples.

After 100,000 training steps with pretraining, the team reports that the best model significantly outperformed a baseline extractive sentence generator on the Recall-Oriented Understudy for Gisting Evaluation (ROUGE), a set of metrics devised to evaluate automatic summarization. Moreover, they say that in a qualitative study involving evaluators recruited through Amazon’s Mechanical Turk, volunteers rated one of the three SummAE models’ summaries “fluent” and “information-relevant” 80% of the time.

“The paragraph reconstructions show some coherence, although with some disfluencies and factual inaccuracies that are common with neural generative models,” wrote the coauthors. “Since the summaries are decoded from the same latent vector as the reconstructions, improving them could lead to more accurate summaries.”
AI Disclaimer: An advanced artificial intelligence (AI) system generated the content of this page on its own. This innovative technology conducts extensive research from a variety of reliable sources, performs rigorous fact-checking and verification, cleans up and balances biased or manipulated content, and presents a minimal factual summary that is just enough yet essential for you to function as an informed and educated citizen. Please keep in mind, however, that this system is an evolving technology, and as a result, the article may contain accidental inaccuracies or errors. We urge you to help us improve our site by reporting any inaccuracies you find using the "Contact Us" link at the bottom of this page. Your helpful feedback helps us improve our system and deliver more precise content. When you find an article of interest here, please look for the full and extensive coverage of this topic in traditional news sources, as they are written by professional journalists that we try to support, not replace. We appreciate your understanding and assistance.
Newsletter

Related Articles

0:00
0:00
Close
U.S. and Hungarian Officials Talk About Economic Collaboration and Sanctions Strategy
Technology Giants Activate Lobbying Campaigns Against Strict EU Regulations
Pope Francis Admitted to Hospital in Rome Amid Increasing Speculation on Succession
Zelensky Calls on World Leaders to Back Peace as Tensions Rise with Trump
UK Leader Keir Starmer Calls for US Security Guarantee in Ukraine Peace Deal
NATO Chief Urges Higher Defense Expenditure in Europe
The negotiation teams of Trump and Putin meet directly, establishing the groundwork for a significant advancement.
Rubio Touches Down in Riyadh Before Key U.S.-Russia Discussions
Students in Serbian universities Unite to Hold Coordinated Protests for Accountability.
US State Department Removes Taiwan Independence Statement from Website
Abolishing opposition won't protect Germany from Nazism—this is precisely what led Germany to become Nazi!
Transatlantic Gold Rush: Traders Shift Bullion in Response to Tariff Anxieties and Market Instability
Bill Ackman Backs Uber as the Company Shifts Towards Profitability
AI Titans Challenge Nvidia's Supremacy in Light of New Chip Innovations
US and Russian Officials to Meet in Saudi Arabia Over Ending Ukraine Conflict. Ukraine and European leaders – who profit from this war – excluded from the negotiations.
Macron Calls for Urgent Summit as Ukraine Conflict Business Model is Threatened
Trump’s Defense Secretary: Ukraine Won’t Join NATO or Regain Lost Territories
Zelensky Urges Europe to Bolster Its Military in Light of Uncertain US Backing
Chinese Zoo Confesses to Dyeing Donkeys to Look Like Zebras
Elon Musk is Sherlock Holmes - Movie Trailer Parody featuring Donald Trump's Detective
Trump's Greenland Suggestion Sparks Sovereignty Discussions Amid Historical Grievances
OpenAI Board Dismisses Elon Musk's Offer to Acquire the Company.
USAID Uncovered: American Taxpayer Funds Leveraged to Erode Democracy in Europe Until Trump Put a Stop to It.
JD Vance and Scholz Did Not Come Together at the Munich Security Conference.
EU Official Participates in Discussions in Washington Amid Trade Strains
Qatar Contemplates Reducing French Investments Due to PSG Chief Investigation
Germany's Green Agenda Encounters Ambiguity Before Elections
Trump Did Not Notify Germany's Scholz About His Ukraine Peace Proposal.
Munich Car Attack Escalates Migration Discourse Before German Elections
NATO Allies Split on Trump's Proposal for 5% Defense Spending Increase
European Parliament Advocates for Encrypted Messaging to Ensure Secure Communications
Trump's Defense Spending Goal Creates Division Among NATO Partners
French Prime Minister Bayrou Navigates a Challenging Path Amid Budget Preservation and Immigration Discourse
Steering Through the Updated Hierarchy at the European Commission
Parliamentarian Calls for Preservation of AI Liability Directive
Mark Rutte Calls on NATO Allies to Increase Defence Expenditures
Dresden Marks the 80th Anniversary of the World War II Bombing.
Global Community Pledges to Aid Syria's Political Transition
EU Allocates €200 Billion for AI Investments, Introduces €20 Billion Fund for Gigafactories
EU Recognizes Its Inability to Close the USAID Funding Shortfall Due to Stalled US Aid
Commission President von der Leyen Missing from Notre Dame Reopening Due to Last-Minute Cancellation
EU Officializes Disinformation Code for Online Platforms, Omitting X
EU Fails to Fully Implement Key Cybersecurity Directives
EU Under Fire for Simplification Discussions Regarding Corporate Sustainability Reporting
Shein Encountering Further Information Request from the EU During Ongoing Investigation
European Commission Initiates Investigation into Shein as It Aims at Chinese E-Commerce Regulations
German Officials Respond to U.S. Proposal for Peace Talks with Russia
Senate Approves Robert F. Kennedy Jr. as Secretary of Health and Human Services.
Trump and Putin Engage in Discussions on Ukraine Peace Negotiations Amid Worldwide Responses
Honda and Nissan End Merger Talks
×