Tags: web-review, benchmarking, cpu, foss, gpt, python, game, criticism, filesystem, management, strategy, machine-learning, tech, programming, science, social-media, tools, art, hardware, safety, 3d, networking, research, data, distributed, vulkan, portability, vision, reliability, data-visualization, security, data-science, ai, cryptography, history
Let’s go for my web review for the week 2024-23.
Tags: tech, ai, gpt, machine-learning, safety, research
Another cruel reminder that basic reasoning is not to be expected from LLMs. Here is a quote from the conclusion of the paper which makes it clear:
“We think that observations made in our study should serve as strong reminder that current SOTA LLMs are not capable of sound, consistent reasoning, as shown here by their breakdown on even such a simple task as the presented AIW problem, and enabling such reasoning is still subject of basic research. This should be also a strong warning against overblown claims for such models beyond being basic research artifacts to serve as problem solvers in various real world settings, which are often made by different commercial entities in attempt to position their models as a strong mature product for end-users. […] Observed breakdown of basic reasoning capabilities, coupled with such public claims (which are also based on standardized benchmarks), present an inherent safety problem. Models with insufficient basic reasoning are inherently unsafe, as they will produce wrong decisions in various important scenarios that do require intact reasoning.”
https://arxiv.org/abs/2406.02061
Tags: tech, social-media, art, criticism
Interesting critique of this new platform… it’s the beginning of the hype cycle but will probably exhibit the same decay phenomenon than other platforms.
Tags: tech, foss, security, data, data-science
The more releases out there the more vulnerabilities are (and could be) discovered. Some actions are necessary to get things under control properly.
https://opensourcesecurity.io/2024/06/03/why-are-vulnerabilities-out-of-control-in-2024/
Tags: tech, networking, reliability
A good reminder of everything which might go wrong when connectivity is bad. Most tools let you down in such a case.
https://brr.fyi/posts/engineering-for-slow-internet
Tags: tech, hardware, cpu
Very nice explanation and metaphors on how CPUs cache levels work.
https://fgiesen.wordpress.com/2016/08/07/why-do-cpus-have-multiple-cache-levels/
Tags: tech, benchmarking, tools
Looks like an interesting benchmarking tool. To keep an eye on.
https://github.com/sosy-lab/benchexec
Tags: tech, programming, python
Definitely a nice Python trick. Fairly elegant, I’ll try to remember it.
https://mathspp.com/blog/til/order-values-of-dictionary-by-iterable-of-keys-with-operator.itemgetter
Tags: tech, filesystem, distributed, safety, cryptography
Interesting use of cryptography without a security concern. It’s more about safety and ensuring something wasn’t missed by mistake.
https://mazzo.li/posts/mac-distributed-tx.html
Tags: tech, 3d, vulkan, portability
The difficult path for Vulkan. The data obviously is biased since it includes games and most of them are still targeting Windows and so DirectX. I’d be curious to see something similar excluding games (and so focusing on medical, industrial etc.).
https://www.carette.xyz/posts/state_of_vulkan_2024/
Tags: tech, 3d, game, vulkan
Interesting dive into the experience of writing a small Vulkan engine (almost) from scratch.
https://edw.is/learning-vulkan/
Tags: tech, management, strategy, vision
Packed with useful information. Clearly some things I’m eager to test in there.
https://blog.practicalengineering.management/how-to-build-engineering-strategy-42e464018177
Tags: science, history, data-visualization
So yes, the climate changed before… now slowly scroll until the end to appreciate how brutal it is this time.
Bye for now!