Skip Navigation
InitialsDiceBearhttps://github.com/dicebear/dicebearhttps://creativecommons.org/publicdomain/zero/1.0/„Initials” (https://github.com/dicebear/dicebear) by „DiceBear”, licensed under „CC0 1.0” (https://creativecommons.org/publicdomain/zero/1.0/)GO
Posts 305
Comments 2.9K

Lebanese Rule

4

Skill issue

13

Creak creak

16

Spread 'em

11
"The **Most open** Operating System"
  • At least the Aptos default typeset they use is better than Calibri. I hate Calibri but I can't quite articulate why. Maybe because it's the default and I'll hate Aptos eventually too.

  • Irrational numbers are an infinite source of information.
  • Situationally, yes. "I want the next digit of pi" is information in that sense of the word. It's not a particularly useful piece of information unless you're building something that requires a circle with a circumferential precision larger than the width of our entire universe.

  • Coffee Rule

    Also hot dog water

    75

    Tesla Cybertruck Owners Discover New Weakness: Magnets

    lol

    6

    Time for a road trip

    56

    "I never asked for this"

    67

    Quite an explosive finish

    116

    We have the best shaped states, don't we folks

    9

    Beer, I summon thee

    23

    Radio Dial Rule

    95

    Archive link

    lol what a shit vehicle

    1

    Space pirates' ultimate version of walking the plank would be sending their victims into a black hole

    Passing the event horizon, spaghettification, and certain death await. Yarrr.

    16

    The election is over so here's the happily ever after

    It's so horrible and the large black bars make it worse lol

    14
    Microblog Memes @lemmy.world Gork @lemm.ee

    I thought they'd be fapping to Dear Leader

    79

    FFmpeg devs boast of up to 94x performance boost after implementing handwritten AVX-512 assembly code

    Contemporary high-level programming languages and advanced compilers greatly simplify software development and lower its costs. However, this way of programming can hide the performance capabilities of modern hardware, partly due to inefficiencies of application programming interfaces (APIs). Apparently, a good old assembly code path can improve performance by between three and 94 times, depending on the workload, according to FFmpeg. The hardware this multiplied performance was achieved on was not disclosed.

    FFmpeg is an open-source video decoding project developed by volunteers who contribute to its codebase, fix bugs, and add new features. The project is led by a small group of core developers and maintainers who oversee its direction and ensure that contributions meet certain standards. They coordinate the project's development and release cycles, merging contributions from other developers. This group of developers tried to implement a handwritten AVX512 assembly code path, something that has rarely been done before, at least not in the video industry.

    The developers have created an optimized code path using the AVX-512 instruction set to accelerate specific functions within the FFmpeg multimedia processing library. By leveraging AVX-512, they were able to achieve significant performance improvements — from three to 94 times faster — compared to standard implementations. AVX-512 enables processing large chunks of data in parallel using 512-bit registers, which can handle up to 16 single-precision FLOPS or 8 double-precision FLOPS in one operation. This optimization is ideal for compute-heavy tasks in general, but in the case of video and image processing in particular.

    The benchmarking results show that the new handwritten AVX-512 code path performs considerably faster than other implementations, including baseline C code and lower SIMD instruction sets like AVX2 and SSE3. In some cases, the revamped AVX-512 codepath achieves a speedup of nearly 94 times over the baseline, highlighting the efficiency of hand-optimized assembly code for AVX-512.

    This development is particularly valuable for users running on high-performance, AVX-512-capable hardware, enabling them to process media content far more efficiently. There is an issue, though: Intel disabled AVX-512 for its Core 12th, 13th, and 14th Generations of Core processors, leaving owners of these CPUs without them. On the other hand, AMD's Ryzen 9000-series CPUs feature a fully-enabled AVX-512 FPU so the owners of these processors can take advantage of the FFmpeg achievement.

    Unfortunately, due to the complexity and specialized nature of AVX-512, such optimizations are typically reserved for performance-critical applications and require expertise in low-level programming and processor microarchitecture.

    9

    New mobile features are sh*t these days

    110

    Blessicrule

    24

    Rock and Stone

    30