The GNU libc atanh is correctly rounded

jcranmer · 2026-04-18T00:31:56.000Z 1776472316

One of the major projects that's ongoing in the current decade is moving the standard math library functions to fully correctly-rounded, as opposed to the traditional accuracy target of ~1 ULP (the last bit is off).

For single-precision unary functions, it's easy enough to just exhaustively test every single input (there's only 4 billion of them). But double precision has prohibitively many inputs to test, so you have to resort to actual proof techniques to prove correct rounding for double-precision functions.

WalterGR · 2026-04-18T05:22:54.000Z 1776489774

> traditional accuracy target of ~1 ULP

I had to google this one…

ULP: “Unit in the Last Place” or “Unit of Least Precision: https://en.wikipedia.org/wiki/Unit_in_the_last_place

mananaysiempre · 2026-04-18T07:27:40.000Z 1776497260

For what it’s worth, this is basically the first word you learn when discussing numerical precision; and I mean word—nobody thinks of it as an abbreviation, to the point that it’s very often written in lower case. So welcome to the club.

adgjlsfhk1 · 2026-04-18T05:47:29.000Z 1776491249

to me this feels like wasted effort due to solving the wrong problem. The extra half ulp error makes no difference to the accuracy of calculations. the problem is that languages traditionally rely on an OS provided libm leading to cross architecture differences. If instead, languages use a specific libm, all of these problems vanish.

SideQuark · 2026-04-18T11:01:09.000Z 1776510069

> The extra half ulp error makes no difference to the accuracy of calculations

It absolutely does matter. The first, and most important reason, is one needs to know the guarantees of every operation in order to design numerical algorithms that meet some guarantee. Without knowing that the components provide, it's impossible to design algorithms on top with some guarantee. And this is needed in a massive amount of applications, from CAD, simulation, medical and financial items, control items, aerospace, and on and on.

And once one has a guarantee, making the lower components tighter allows higher components to do less work. This is a very low level component, so putting the guarantees there reduces work for tons of downstream work.

All this is precisely what drove IEEE 754 to become a thing and to become the standard in modern hardware.

> the problem is that languages traditionally rely on an OS provided libm leading to cross architecture differences

No, they don't not things like sqrt and atanh and related. They've relied on compiler provided libs since, well, as long as there have been languages. And the higher level libs, like BLAS, are built on specific compilers that provide guarantees by, again, libs the compiler used. I've not seen OS level calls describing the accuracy of the floating point items, but a lot of languages do, including C/C++ which underlies a lot of this code.

lifthrasiir · 2026-04-18T05:57:01.000Z 1776491821

Standardizing a particular libm essentially locks any further optimizations because that libm's implementation quirks have to be exactly followed. In comparison the "most correct" (0.5 ulp) answer is easy to standardize and agree upon.

ghighi7878 · 2026-04-18T10:07:52.000Z 1776506872

Mixed precision computations need correctly rounded functions.

fweimer · 2026-04-18T06:31:13.000Z 1776493873

Many of the conversions so far have been clearly faster. I don't think anything has been merged which shows a clear performance regression, at least not on CPUs with FMA support.

gajjanag · 2026-04-18T11:41:39.000Z 1776512499

The bigger challenge is GPU/NPU. Branches for fast vs accurate path get costlier, among other things. On CPU this is less of a cost.

Most published libm on GPU/NPU side have a few ULP of error for the perf vs accuracy tradeoff. Eg, documented explicitly in the CUDA programming guide: https://docs.nvidia.com/cuda/cuda-programming-guide/05-appen... .

Prof. Zimmermann and collaborators have a great table at https://members.loria.fr/PZimmermann/papers/accuracy.pdf (Feb 2026) comparing various libm wrt accuracy.

RyJones · 2026-04-18T01:26:53.000Z 1776475613

Interesting: https://youtu.be/cb5r3r38O9c

Guy's world records get deleted due to changes in atanh over time

ncruces · 2026-04-18T08:24:08.000Z 1776500648

Pretty sure that's atan.

im3w1l · 2026-04-18T03:43:48.000Z 1776483828

As that's a pretty long video would you mind giving a short summary of what happened? Was it a world record in a game?

dgaudet · 2026-04-18T04:31:01.000Z 1776486661

yeah one of the trackmania games -- which feature a nominally deterministic physics engine, allowing for replays from a recorded sequence of inputs... except the physics engine relies on libc transcendental functions. players are generally on windows, but backend servers doing anti-cheat validations via replays are running linux. this resulted in false cheat positives when the linux server was running glibc prior to the glibc rounding fixes... and as a result the guy's world record kept being flagged as a cheat. it's a pretty good video with a lot of detail on how they narrowed it down to specific glibc versions/etc.

kergonath · 2026-04-17T23:10:52.000Z 1776467452

I don’t think I ever used atanh, but I always love some floating-point nerdery. These other documents by the same team are fantastic resources: https://inria.hal.science/hal-04714173v2/document for complex values and https://members.loria.fr/PZimmermann/papers/accuracy.pdf for real values.

Lots of good stuff here: https://members.loria.fr/PZimmermann/papers/ .

nmbrskeptix · 2026-04-17T23:19:55.000Z 1776467995

Tanh, and therefore atanh, are wonderful.

It's linear for small x, and exponential for large. Lots of applications of this:

Compressing data

Mapping (zoomed in near by, zoned out from afar)

There's a whole class of electronics amps for this.

cyco130 · 2026-04-18T10:10:53.000Z 1776507053

tanh is a very pleasant sounding overdrive function for audio, for example.

RandomTeaParty · 2026-04-18T01:35:04.000Z 1776476104

Why not arxiv?

TimorousBestie · 2026-04-18T01:58:23.000Z 1776477503

The author works at a French university. Some French researchers do choose to cross-post to arXiv (and Zimmermann may have too, I haven’t checked), but HAL is the default.

mananaysiempre · 2026-04-18T08:48:05.000Z 1776502085

> HAL is the default.

HAL is an institutional requirement, I believe.

jonathrg · 2026-04-17T23:08:32.000Z 1776467312

Good to know!

brcmthrowaway · 2026-04-18T01:18:33.000Z 1776475113

Who wrote it? Someone at Red Hat likely.

stephencanon · 2026-04-18T04:11:56.000Z 1776485516

The CORE-MATH project authors, most of whom are French academics (including the author of the linked paper).

I don’t know of any interesting work in this space that came out of Red Hat, why do you suggest them?

DiabloD3 · 2026-04-18T04:11:13.000Z 1776485473

https://en.wikipedia.org/wiki/Paul_Zimmermann_(mathematician...

He doesn't work for Red Hat.

ameliaquining · 2026-04-18T04:20:04.000Z 1776486004

As the paper mentions, this particular routine was the work of Alexei Sibidanov, though Zimmermann seems to have been maintaining it since it was contributed. (Sibidanov doesn't work for Red Hat either, though.)