Global AI Weekly

Issue number: 61 | Tuesday, July 16, 2024

Highlights

if(article.Image != null) { Can the climate survive AI’s thirst for energy? – podcast }

Can the climate survive AI’s thirst for energy? – podcast

Artificial intelligence companies have lofty ambitions for what the technology could achieve, from curing diseases to eliminating poverty. But the energy required to power these innovations is threatening critical environmental targets. Madeleine Finlay hears from the Guardian’s energy correspondent, Jillian Ambrose, and UK technology editor, Alex Hern, to find out how big AI’s energy problem is, and whether it can be solved before it is too late.

theguardian.com

if(article.Image != null) { RUBICON: Evaluating conversations between humans and AI systems }

RUBICON: Evaluating conversations between humans and AI systems

RUBICON evaluates AI-driven conversations and improves their quality by learning detailed domain-specific rubrics from minimal data. It gathers insights on AI assistant performance while maintaining user privacy and data security.

microsoft.com

Research

if(article.Image != null) {  Better relation and entity extraction with LLMs }

Better relation and entity extraction with LLMs

We've talked about building knowledge graphs before. It's still a very new topic, and a lot of new research is coming out every week. In this week's paper, for example, we explore how to get better results extracting relationships from documents by mixing large language models and traditional entity and relationship recognition methods. Again, one of those papers is niche but very useful.

arxiv.org

Video

if(article.Image != null) { Copilot L33t Sp34k - RAI and UX principals in Copilot for Security }

Copilot L33t Sp34k - RAI and UX principals in Copilot for Security

Sarah chats to Mark Kendrick and Harmony Mabry from the Copilot for Security product team about how Microsoft's Responsible AI (RAI) principles are realized in the Copilot for Security product to protect against AI harms such as overreliance.

youtube.com

if(article.Image != null) { Imitation Intelligence, my keynote for PyCon US 2024 }

Imitation Intelligence, my keynote for PyCon US 2024

I gave an invited keynote at PyCon US 2024 in Pittsburgh this year. My goal was to say some interesting things about AI - specifically about Large Language Models - both to help catch people up who may not have been paying close attention, but also to give people who were paying close attention some new things to think about.

simonwillison.net

Articles

if(article.Image != null) { Unified Database: Laying the foundation for large language model vertical applications }

Unified Database: Laying the foundation for large language model vertical applications

Unified databases offer better knowledge transfer between multimodal data types. They provide substantial corpus support for large language models and are poised to drive innovation in underlying hardware, laying the foundation for data-enhanced AI.

microsoft.com

if(article.Image != null) { Diffusion Model from Scratch in Pytorch }

Diffusion Model from Scratch in Pytorch

In our implementation of the model, we will start by defining our imports (possible pip install commands commented for reference) and coding our sinusoidal time step embeddings. The authors of the DDPM paper used the UNET architecture originally designed for medical image segmentation to build a model to predict the noise for the diffusion reverse process.

towardsdatascience.com

if(article.Image != null) { Hacking “Codenames” with GloVe Embeddings }

Hacking “Codenames” with GloVe Embeddings

In conclusion, this greedy GloVe-based algorithm performs well as both the spymaster and operative in the Codenames game, by offering an effective way to encode and decode words via a clue and number.

towardsdatascience.com

if(article.Image != null) { Big Opportunities in Small Data }

Big Opportunities in Small Data

I gave an invited keynote at Citus Con 2023, the PostgreSQL conference. Below is the abstract, video and slides from the presentation.

simonwillison.net

if(article.Image != null) { FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision }

FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision

Attention, as a core layer of the ubiquitous Transformer architecture, is a bottleneck for large language models and long-context applications. FlashAttention (and FlashAttention-2) pioneered an approach to speed up attention on GPUs by minimizing memory reads/writes, and is now used by most libraries to accelerate Transformer training and inference. This has contributed to a massive increase in LLM context length in the last two years, from 2-4K (GPT-3, OPT) to 128K (GPT-4), or even 1M (Llama 3).

pytorch.org

if(article.Image != null) { Haisor: Human-aware Indoor Scene Optimization via Deep Reinforcement Learning }

Haisor: Human-aware Indoor Scene Optimization via Deep Reinforcement Learning

3D scene synthesis facilitates and benefits many real-world applications. Most scene generators focus on making indoor scenes plausible via learning from training data and leveraging extra constraints such as adjacency and symmetry.

research.nvidia.com

if(article.Image != null) { Study reveals why AI models that analyze medical images can be biased }

Study reveals why AI models that analyze medical images can be biased

These models, which can predict a patient’s race, gender, and age, seem to use those traits as shortcuts when making medical diagnoses.

news.mit.edu

if(article.Image != null) { Meet “Rufus”: Amazon’s New AI Chatbot Revolutionizing E-commerce }

Meet “Rufus”: Amazon’s New AI Chatbot Revolutionizing E-commerce

In the world of e-commerce, Amazon has always been at the forefront of innovation. This year, they introduced a game-changer: Rufus, an…

medium.com

if(article.Image != null) { Reasoning skills of large language models are often overestimated }

Reasoning skills of large language models are often overestimated

New CSAIL research highlights how LLMs excel in familiar scenarios but struggle in novel ones, questioning their true reasoning abilities versus reliance on memorization.

news.mit.edu

if(article.Image != null) { OpenAI Whistleblowers Urge SEC Investigation into Non-Disclosure Agreements }

OpenAI Whistleblowers Urge SEC Investigation into Non-Disclosure Agreements

In a significant move, whistleblowers from OpenAI have approached the Securities and Exchange Commission (SEC) to investigate the…

medium.com

if(article.Image != null) { ChatGPT and Cybersecurity: Navigating the Double-Edged Sword }

ChatGPT and Cybersecurity: Navigating the Double-Edged Sword

Discover how ChatGPT is transforming cybersecurity, presenting both opportunities and risks. Learn about AI-generated phishing scams…

medium.com

Code

if(article.Image != null) { PyEnv & Poetry Tutorial: Ultimate Data Science Setup }

PyEnv & Poetry Tutorial: Ultimate Data Science Setup

How to use PyEnv and Poetry together for your environment and package management for data science projects.

towardsdatascience.com

>