Language Modelling - Search News

Open source Mamba 3 arrives to surpass Transformer architecture with nearly 4% improved language modeling, reduced latency

This release is good for developers building long-context applications, real-time reasoning agents, or those seeking to reduce GPU costs in high-volume production environments.

The Economist

Top AI models underperform in languages other than English

This illustrates a widespread problem affecting large language models (LLMs): even when an English-language version passes a safety test, it can still hallucinate dangerous misinformation in other ...

Tech Xplore on MSN

A better method for identifying overconfident large language models

Large language models (LLMs) can generate credible but inaccurate responses, so researchers have developed uncertainty quantification methods to check the reliability of predictions. One popular ...

2monon MSN

Some elite AI researchers say language is limiting. Here's the new kind of model they are building instead.

Top AI researchers like Fei-Fei Li and Yann LeCun are developing world models, which don't rely solely on language.

Nvidia shrinks LLM memory 20x without changing model weights

Nvidia's KV Cache Transform Coding (KVTC) compresses LLM key-value cache by 20x without model changes, cutting GPU memory costs and time-to-first-token by up to 8x for multi-turn AI applications.

OpenAI, Mistral AI release new hardware-efficient language models

OpenAI Group PBC and Mistral AI SAS today introduced new artificial intelligence models optimized for cost-sensitive use cases. OpenAI is rolling out two algorithms called GPT-5.4 mini and GPT 5.4 ...

TechCrunch

The emerging types of language models and why they matter

AI systems that understand and generate text, known as language models, are the hot new thing in the enterprise. A recent survey found that 60% of tech leaders said that their budgets for AI language ...

Communications of the ACM

Measuring What Matters in Large Language Model Performance

As large language models (LLMs) gain momentum worldwide, there’s a growing need for reliable ways to measure their performance. Benchmarks that evaluate LLM outputs allow developers to track ...

10mon

Exclusive: New Claude Model Triggers Stricter Safeguards at Anthropic

Anthropic has long been warning about these risks—so much so that in 2023, the company pledged to not release certain models ...

News-Medical.Net

Study reveals limitations of large language models in medical diagnostics

Artificial intelligence (AI) is rapidly transforming healthcare. AI systems can now detect diabetic eye disease from retinal photos and analyze CT images for signs of early-stage lung cancers and ...

Results that may be inaccessible to you are currently showing.

Hide inaccessible results