Notable and Interesting Recent AI News, Articles, and Papers for Tuesday, July 23, 2024

A selection of the most important recent news, articles, and papers about AI.


Image of a futuristic AI data center

News, Articles, and Analyses

OpenAI Slashes the Cost of Using Its AI With a ‘Mini’ Model | WIRED

(Thursday, July 18, 2024) “With competing models—including many free ones—flooding the market, OpenAI is announcing a cheaper way to use its AI.”

AI in Context: Cloudera Accelerates AI ROI with Verta Acquisition – The Futurum Group

Author: Dr. Bob Sutor

“Learn why Cloudera’™s acquisition of Verta was a smart move to extend its AI capabilities and accelerate customer AI implementation ROI.”

Technical Papers and Preprints

[2407.15160] When Can Transformers Count to n?

Authors: Yehudai, Gilad; Kaplan, Haim; Ghandeharioun, Asma; Geva, Mor; Globerson, Amir

arXiv logo(Sunday, July 21, 2024) “Large language models based on the transformer architectures can solve highly complex tasks. But are there simple tasks that such models cannot solve? Here we focus on very simple counting tasks, that involve counting how many times a token in the vocabulary have appeared in a string. We show that if the dimension of the transformer state is linear in the context length, this task can be solved. However, the solution we propose does not scale beyond this limit, and we provide theoretical arguments for why it is likely impossible for a size limited transformer to implement this task. Our empirical results demonstrate the same phase-transition in performance, as anticipated by the theoretical argument. Our results demonstrate the importance of understanding how transformers can solve simple tasks.”

[2407.15671] Problems in AI, their roots in philosophy, and implications for science and society

Authors: Velthoven, Max; Marcus, Eric

arXiv logo(Monday, July 22, 2024) “Artificial Intelligence (AI) is one of today’s most relevant emergent technologies. In view thereof, this paper proposes that more attention should be paid to the philosophical aspects of AI technology and its use. It is argued that this deficit is generally combined with philosophical misconceptions about the growth of knowledge. To identify these misconceptions, reference is made to the ideas of the philosopher of science Karl Popper and the physicist David Deutsch. The works of both thinkers aim against mistaken theories of knowledge, such as inductivism, empiricism, and instrumentalism. This paper shows that these theories bear similarities to how current AI technology operates. It also shows that these theories are very much alive in the (public) discourse on AI, often called Bayesianism. In line with Popper and Deutsch, it is proposed that all these theories are based on mistaken philosophies of knowledge. This includes an analysis of the implications of these mistaken philosophies for the use of AI in science and society, including some of the likely problem situations that will arise. This paper finally provides a realistic outlook on Artificial General Intelligence (AGI) and three propositions on A(G)I and philosophy (i.e., epistemology).”

[2407.15847] LLMmap: Fingerprinting For Large Language Models

Authors: Pasquini, Dario; Kornaropoulos, Evgenios M.; Ateniese, Giuseppe

arXiv logo(Monday, July 22, 2024) “We introduce LLMmap, a first-generation fingerprinting attack targeted at LLM-integrated applications. LLMmap employs an active fingerprinting approach, sending carefully crafted queries to the application and analyzing the responses to identify the specific LLM model in use. With as few as 8 interactions, LLMmap can accurately identify LLMs with over 95% accuracy. More importantly, LLMmap is designed to be robust across different application layers, allowing it to identify LLMs operating under various system prompts, stochastic sampling hyperparameters, and even complex generation frameworks such as RAG or Chain-of-Thought.”

 

Notable and Interesting Recent AI News, Articles, and Papers for Thursday, July 18, 2024

A selection of the most important recent news, articles, and papers about AI.


Image of a futuristic AI data center

News, Articles, and Analyses

IBM text-to-SQL generator tops leaderboard – IBM Research

(Tuesday, July 02, 2024) “IBM’s generative AI solution takes a top spot on the BIRD benchmark for handling complex database queries”

Reaffirming IBM’s commitment to the Rome Call for AI ethics – IBM Research

(Monday, July 15, 2024) “IBM joined representatives from many of the world’s major religions in Japan to discuss ethical AI development.”

AMD takes a deep dive into architecture for the AI PC chips | VentureBeat

Author: Dean Takahashi

(Monday, July 15, 2024) “Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Advanced Micro Devices executives revealed the details of the chipmaker’s latest AI PC architecture, which includes a new neural processing unit (NPU) in the company’s latest AMD Ryzen AI chips. The company announced the latest AMD Ryzen […]”

MathΣtral | Mistral AI | Frontier AI in your hands

(Tuesday, July 16, 2024) “As a tribute to Archimedes, whose 2311th anniversary we’re celebrating this year, we are proud to release our first Mathstral model, a specific 7B model designed for math reasoning and scientific discovery. The model has a 32k context window published under the Apache 2.0 license.”

AI in gaming: Developers worried by generative tech

“In a struggling games industry AI has been hailed as a possible saviour. But not everyone’s convinced.”

Technical Papers and Preprints

[2407.12690] The Dual Imperative: Innovation and Regulation in the AI Era

Author: Carvão, Paulo

arXiv logo(Thursday, May 23, 2024) “This article addresses the societal costs associated with the lack of regulation in Artificial Intelligence and proposes a framework combining innovation and regulation. Over fifty years of AI research, catalyzed by declining computing costs and the proliferation of data, have propelled AI into the mainstream, promising significant economic benefits. Yet, this rapid adoption underscores risks, from bias amplification and labor disruptions to existential threats posed by autonomous systems. The discourse is polarized between accelerationists, advocating for unfettered technological advancement, and doomers, calling for a slowdown to prevent dystopian outcomes. This piece advocates for a middle path that leverages technical innovation and smart regulation to maximize the benefits of AI while minimizing its risks, offering a pragmatic approach to the responsible progress of AI technology. Technical invention beyond the most capable foundation models is needed to contain catastrophic risks. Regulation is required to create incentives for this research while addressing current issues.”

[2407.12043] The Art of Saying No: Contextual Noncompliance in Language Models

Authors: Brahman, Faeze; Kumar, Sachin; Balachandran, Vidhisha; Dasigi, Pradeep; Pyatkin, Valentina; Ravichander, Abhilasha; Wiegreffe, Sarah; Dziri, Nouha; Chandu, Khyathi; Hessel, Jack; Tsvetkov, Yulia; Smith, Noah A.; Choi, Yejin; Hajishirzi, Hannaneh

arXiv logo(Tuesday, July 02, 2024) “Chat-based language models are designed to be helpful, yet they should not comply with every user request. While most existing work primarily focuses on refusal of “unsafe” queries, we posit that the scope of noncompliance should be broadened. We introduce a comprehensive taxonomy of contextual noncompliance describing when and how models should not comply with user requests. Our taxonomy spans a wide range of categories including incomplete, unsupported, indeterminate, and humanizing requests (in addition to unsafe requests). To test noncompliance capabilities of language models, we use this taxonomy to develop a new evaluation suite of 1000 noncompliance prompts. We find that most existing models show significantly high compliance rates in certain previously understudied categories with models like GPT-4 incorrectly complying with as many as 30% of requests. To address these gaps, we explore different training strategies using a synthetically-generated training set of requests and expected noncompliant responses. Our experiments demonstrate that while direct finetuning of instruction-tuned models can lead to both over-refusal and a decline in general capabilities, using parameter efficient methods like low rank adapters helps to strike a good balance between appropriate noncompliance and other capabilities.”

 

Notable and Interesting Recent AI News, Articles, and Papers for Monday, July 15, 2024

A selection of the most important recent news, articles, and papers about AI.


Image of a futuristic AI data center

News, Articles, and Analyses

Developers get by with a little help from AI: Stack Overflow Knows code assistant pulse survey results – Stack Overflow

Gen AI and beyond: Where else to focus now | McKinsey

(Friday, July 12, 2024) “Yes, gen AI can be dazzling. But to deliver value, leaders will have to look beyond center stage.”

Designing for Education with Artificial Intelligence: An Essential Guide for Developers – Office of Educational Technology

“Informing product leads and their teams of innovators, designers, and developers as they work toward safety, security, and trust while creating AI products and services for use in education.”

IBM’s AI, Open-Source Granite Models & Sports Technology – The Futurum Group

Author: Steven Dickens

“Chief Technology Advisor Steven Dickens shares insights on how IBM uses AI to enhance sports, democratizing innovation through open-source.”

Technical Papers and Preprints

[2407.08488] Lynx: An Open Source Hallucination Evaluation Model

Authors: Ravi, Selvan Sunitha; Mielczarek, Bartosz; Kannappan, Anand; Kiela, Douwe; Qian, Rebecca

arXiv logo(Thursday, July 11, 2024) “Retrieval Augmented Generation (RAG) techniques aim to mitigate hallucinations in Large Language Models (LLMs). However, LLMs can still produce information that is unsupported or contradictory to the retrieved contexts. We introduce LYNX, a SOTA hallucination detection LLM that is capable of advanced reasoning on challenging real-world hallucination scenarios. To evaluate LYNX, we present HaluBench, a comprehensive hallucination evaluation benchmark, consisting of 15k samples sourced from various real-world domains. Our experiment results show that LYNX outperforms GPT-4o, Claude-3-Sonnet, and closed and open-source LLM-as-a-judge models on HaluBench. We release LYNX, HaluBench and our evaluation code for public access.”

[2407.08105] Federated Learning and AI Regulation in the European Union: Who is Responsible? — An Interdisciplinary Analysis

Authors: Woisetschläger, Herbert; Mertel, Simon; Krönke, Christoph; Mayer, Ruben; Jacobsen, Hans-Arno

arXiv logo(Thursday, July 11, 2024) “The European Union Artificial Intelligence Act mandates clear stakeholder responsibilities in developing and deploying machine learning applications to avoid substantial fines, prioritizing private and secure data processing with data remaining at its origin. Federated Learning (FL) enables the training of generative AI Models across data siloes, sharing only model parameters while improving data security. Since FL is a cooperative learning paradigm, clients and servers naturally share legal responsibility in the FL pipeline. Our work contributes to clarifying the roles of both parties, explains strategies for shifting responsibilities to the server operator, and points out open technical challenges that we must solve to improve FL’s practical applicability under the EU AI Act.”

 

Notable and Interesting Recent AI News, Articles, and Papers for Thursday, July 11, 2024

A selection of the most important recent news and articles about AI.

Image of a futuristic AI data center

Enabling Quantum Computing with AI | NVIDIA Technical Blog

(Sunday, May 12, 2024) “Building a useful quantum computer in practice is incredibly challenging. Significant improvements are needed in the scale, fidelity, speed, reliability, and programmability of quantum computers to…”

The Words That Give Away Generative AI Text | WIRED

(Sunday, July 07, 2024) “From ‘delves’ to ‘showcasing,’ certain words boomed in usage after LLMs became mainstream.”

Top 5 potential uses, pitfalls for generative AI in federal government

(Monday, July 08, 2024) “We believe Multi-Agent Systems are the only viable approach to bringing generative AI into the U.S. government in a managed manner.”

 

Notable and Interesting Recent AI News, Articles, and Papers for Tuesday, July 9, 2024

Futuristic AI Data Center

Unleash developer productivity with generative AI | McKinsey

(Tuesday, June 27, 2023) “A new McKinsey study shows that software developers can complete tasks up to twice as fast with generative AI. Four actions can help maximize productivity.”

IBM Makes Generative AI Platform for DevOps Available – DevOps.com

(Tuesday, July 02, 2024) “IBM has made available IBM Concert, leveraging generative artificial intelligence and knowledge graphs to surface in real-time dependencies.”

Maintaining human oversight in AI-enhanced software development – Help Net Security

(Wednesday, July 03, 2024) “It’s not that AI-generated code introduces new security gaps; it just means that even more code will make its way through existing gaps.”

Transparency From Behind the Generative AI Curtain – The New Stack

(Friday, July 05, 2024) “The Foundational Model Transparency Index illuminates the black box of data on which large language models are trained.”

Nintendo Says Generative AI Can Be Used in ‘Creative Ways,’ but Highlights IP Issues – IGN

(Friday, July 05, 2024) “Nintendo has commented on the controversial topic of generative AI in video game development, outline the pros and cons as it sees them.”

Enterprises must stop GenAI experiments and start long-term strategies | Computer Weekly

“Steven Webb, chief technology & innovation officer, Capgemini UK argues for enterprise organisations to put aside GenAI experimentation and build long-term strategies with it.”

Gen AI and software development | Deloitte Insights

“Freeplay CEO Ian Cairns describes how the organization has adapted to the paradigm shift that generative AI demands while building AI applications”

Zapata AI and D-Wave Quantum Announce Expanded Partnership for Advanced Generative AI Solutions

“BOSTON and PALO ALTO, Calif., July 8, 2024 — Zapata Computing Holdings Inc., a leader in Industrial Generative AI software solutions, and D-Wave Quantum Inc., a leader in quantum computing […]”

Notable and Interesting Recent AI News, Articles, and Papers for Monday, July 1, 2024

Futuristic AI Data Center

France leads the pack for generative AI funding in Europe | TechCrunch

(Wednesday, June 19, 2024) “Like it or hate it, artificial intelligence — especially generative AI — is the technology story of 2024. OpenAI, with its rollouts of viral services like”

Generative AI Can’t Cite Its Sources

(Wednesday, June 26, 2024) “How will OpenAI keep its promise to media companies?”

Illia Polosukhin On Inventing The Tech Behind Generative AI At Google

(Thursday, June 27, 2024) “Illia Polosukhin is one of the “Transformer 8,” a group that many call the founding fathers of generative AI. They co-wrote a paper at Google in 2017 that es…”

How generative AI could reinvent what it means to play

“AI-powered NPCs that don’t need a script could make games—and other worlds—deeply immersive.”

Cornell transforms generative AI education and clones a faculty member | Cornell Chronicle

“Designing and Building AI Solutions is a new online certificate program, with one-of-a-kind features designed to enhance the learning experience for those that desire to build their own AI products—no coding required.”

Verified by MonsterInsights