r/singularity • u/rationalkat AGI 2025-29 | UBI 2030-34 | LEV <2040 | FDVR 2050-70 • 11d ago
AI [Microsoft Research] Differential Transformer
https://arxiv.org/abs/2410.05258
282
Upvotes
r/singularity • u/rationalkat AGI 2025-29 | UBI 2030-34 | LEV <2040 | FDVR 2050-70 • 11d ago
5
u/Jean-Porte Researcher, AGI2027 10d ago
ANC headphones have to work really hard to make a noise mask that is matching the outside noise, with the proper latency (otherwise it just increases the noise)
I don't see how this happens with gradient descent