A selection of the most important recent news, articles, and papers about AI.
News, Articles, and Analyses
(Tuesday, July 02, 2024) “IBM’s generative AI solution takes a top spot on the BIRD benchmark for handling complex database queries”
(Monday, July 15, 2024) “IBM joined representatives from many of the world’s major religions in Japan to discuss ethical AI development.”
Author: Dean Takahashi
(Monday, July 15, 2024) “Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Advanced Micro Devices executives revealed the details of the chipmaker’s latest AI PC architecture, which includes a new neural processing unit (NPU) in the company’s latest AMD Ryzen AI chips. The company announced the latest AMD Ryzen […]”
(Tuesday, July 16, 2024) “As a tribute to Archimedes, whose 2311th anniversary we’re celebrating this year, we are proud to release our first Mathstral model, a specific 7B model designed for math reasoning and scientific discovery. The model has a 32k context window published under the Apache 2.0 license.”
“In a struggling games industry AI has been hailed as a possible saviour. But not everyone’s convinced.”
Technical Papers and Preprints
Author: Carvão, Paulo
(Thursday, May 23, 2024) “This article addresses the societal costs associated with the lack of regulation in Artificial Intelligence and proposes a framework combining innovation and regulation. Over fifty years of AI research, catalyzed by declining computing costs and the proliferation of data, have propelled AI into the mainstream, promising significant economic benefits. Yet, this rapid adoption underscores risks, from bias amplification and labor disruptions to existential threats posed by autonomous systems. The discourse is polarized between accelerationists, advocating for unfettered technological advancement, and doomers, calling for a slowdown to prevent dystopian outcomes. This piece advocates for a middle path that leverages technical innovation and smart regulation to maximize the benefits of AI while minimizing its risks, offering a pragmatic approach to the responsible progress of AI technology. Technical invention beyond the most capable foundation models is needed to contain catastrophic risks. Regulation is required to create incentives for this research while addressing current issues.”
Authors: Brahman, Faeze; Kumar, Sachin; Balachandran, Vidhisha; Dasigi, Pradeep; Pyatkin, Valentina; Ravichander, Abhilasha; Wiegreffe, Sarah; Dziri, Nouha; Chandu, Khyathi; Hessel, Jack; Tsvetkov, Yulia; Smith, Noah A.; Choi, Yejin; Hajishirzi, Hannaneh
(Tuesday, July 02, 2024) “Chat-based language models are designed to be helpful, yet they should not comply with every user request. While most existing work primarily focuses on refusal of “unsafe” queries, we posit that the scope of noncompliance should be broadened. We introduce a comprehensive taxonomy of contextual noncompliance describing when and how models should not comply with user requests. Our taxonomy spans a wide range of categories including incomplete, unsupported, indeterminate, and humanizing requests (in addition to unsafe requests). To test noncompliance capabilities of language models, we use this taxonomy to develop a new evaluation suite of 1000 noncompliance prompts. We find that most existing models show significantly high compliance rates in certain previously understudied categories with models like GPT-4 incorrectly complying with as many as 30% of requests. To address these gaps, we explore different training strategies using a synthetically-generated training set of requests and expected noncompliant responses. Our experiments demonstrate that while direct finetuning of instruction-tuned models can lead to both over-refusal and a decline in general capabilities, using parameter efficient methods like low rank adapters helps to strike a good balance between appropriate noncompliance and other capabilities.”
Share this:
A selection of the most important recent news, articles, and papers about AI.
News, Articles, and Analyses
(Friday, July 12, 2024) “Yes, gen AI can be dazzling. But to deliver value, leaders will have to look beyond center stage.”
“Informing product leads and their teams of innovators, designers, and developers as they work toward safety, security, and trust while creating AI products and services for use in education.”
Author: Steven Dickens
“Chief Technology Advisor Steven Dickens shares insights on how IBM uses AI to enhance sports, democratizing innovation through open-source.”
Technical Papers and Preprints
Authors: Ravi, Selvan Sunitha; Mielczarek, Bartosz; Kannappan, Anand; Kiela, Douwe; Qian, Rebecca
(Thursday, July 11, 2024) “Retrieval Augmented Generation (RAG) techniques aim to mitigate hallucinations in Large Language Models (LLMs). However, LLMs can still produce information that is unsupported or contradictory to the retrieved contexts. We introduce LYNX, a SOTA hallucination detection LLM that is capable of advanced reasoning on challenging real-world hallucination scenarios. To evaluate LYNX, we present HaluBench, a comprehensive hallucination evaluation benchmark, consisting of 15k samples sourced from various real-world domains. Our experiment results show that LYNX outperforms GPT-4o, Claude-3-Sonnet, and closed and open-source LLM-as-a-judge models on HaluBench. We release LYNX, HaluBench and our evaluation code for public access.”
Authors: Woisetschläger, Herbert; Mertel, Simon; Krönke, Christoph; Mayer, Ruben; Jacobsen, Hans-Arno
(Thursday, July 11, 2024) “The European Union Artificial Intelligence Act mandates clear stakeholder responsibilities in developing and deploying machine learning applications to avoid substantial fines, prioritizing private and secure data processing with data remaining at its origin. Federated Learning (FL) enables the training of generative AI Models across data siloes, sharing only model parameters while improving data security. Since FL is a cooperative learning paradigm, clients and servers naturally share legal responsibility in the FL pipeline. Our work contributes to clarifying the roles of both parties, explains strategies for shifting responsibilities to the server operator, and points out open technical challenges that we must solve to improve FL’s practical applicability under the EU AI Act.”
Share this:
A selection of the most important recent news and articles about AI.
(Sunday, May 12, 2024) “Building a useful quantum computer in practice is incredibly challenging. Significant improvements are needed in the scale, fidelity, speed, reliability, and programmability of quantum computers to…”
(Sunday, July 07, 2024) “From ‘delves’ to ‘showcasing,’ certain words boomed in usage after LLMs became mainstream.”
(Monday, July 08, 2024) “We believe Multi-Agent Systems are the only viable approach to bringing generative AI into the U.S. government in a managed manner.”
Share this:
(Tuesday, June 27, 2023) “A new McKinsey study shows that software developers can complete tasks up to twice as fast with generative AI. Four actions can help maximize productivity.”
(Tuesday, July 02, 2024) “IBM has made available IBM Concert, leveraging generative artificial intelligence and knowledge graphs to surface in real-time dependencies.”
(Wednesday, July 03, 2024) “It’s not that AI-generated code introduces new security gaps; it just means that even more code will make its way through existing gaps.”
(Friday, July 05, 2024) “The Foundational Model Transparency Index illuminates the black box of data on which large language models are trained.”
(Friday, July 05, 2024) “Nintendo has commented on the controversial topic of generative AI in video game development, outline the pros and cons as it sees them.”
“Steven Webb, chief technology & innovation officer, Capgemini UK argues for enterprise organisations to put aside GenAI experimentation and build long-term strategies with it.”
“Freeplay CEO Ian Cairns describes how the organization has adapted to the paradigm shift that generative AI demands while building AI applications”
“BOSTON and PALO ALTO, Calif., July 8, 2024 — Zapata Computing Holdings Inc., a leader in Industrial Generative AI software solutions, and D-Wave Quantum Inc., a leader in quantum computing […]”
Share this:
- Threads
-
Categories AI Tags AI, Capgemini, D-Wave, Deloitte, games, Generative AI, IBM, McKinsey, quantum computing, video, Zapata
(Wednesday, June 19, 2024) “Like it or hate it, artificial intelligence — especially generative AI — is the technology story of 2024. OpenAI, with its rollouts of viral services like”
(Wednesday, June 26, 2024) “How will OpenAI keep its promise to media companies?”
(Thursday, June 27, 2024) “Illia Polosukhin is one of the “Transformer 8,” a group that many call the founding fathers of generative AI. They co-wrote a paper at Google in 2017 that es…”
“AI-powered NPCs that don’t need a script could make games—and other worlds—deeply immersive.”
“Designing and Building AI Solutions is a new online certificate program, with one-of-a-kind features designed to enhance the learning experience for those that desire to build their own AI products—no coding required.”