Anemll
@anemll
ANEMLL (pronounced like "animal") Artificial Neural Engine Machine Learning Library, Open Source Project
If every post-2020 Apple device lit up its Neural Engine at once, humanity would have ~20 zettainteger-ops-per-second of on-device AI oomph—about five times the cumulative floating-point tensor capacity of all NVIDIA GPUs sold in the same period. In practice, Nvidia’s…
MLP from dequantized int8 with CoreML 9.x test / example source code gist.github.com/Anemll/a9838e1…



I think it should be CoreML 26 🤣
Incoming new coremltools looks like it has some nice bits: - 8 bit input/output tensors (previously all 8bit compute was kept internal) - >1 input can be enumerated shapes (👀ANE)
Sure, here's a sample run: Prompt: 3135 tokens, 249.467 tokens-per-sec Generation: 100 tokens, 24.264 tokens-per-sec Peak memory: 272.008 GB
It’s amazing how CoreML visualization looks like circuit / FPGA design ! Here is AltUp representation for Gemma 3n. It would be cool to see/edit a graph representation alongside PyTorch or MLX code during development or debugging 🤔
