maderix/ANE — Training neural networks on Apple Neural Engine via reverse-engineered private APIs
Developer Manjeet Singh reverse-engineered Apple's undocumented _ANEClient and _ANECompiler private APIs to enable full transformer training — forward pass, backward pass, attention, gradients, and optimizer — running directly on Apple's Neural Engine. No CoreML. No Metal. No GPU. Pure ANE compute.
Achieves 9.3ms per training step on M4 silicon and 6.6 TFLOPS/W power efficiency. Builds with a single clang command and zero external dependencies (macOS system frameworks only).
exec(). Not production-ready without significant hardening.
Zero external dependencies — macOS system frameworks only. Single maintainer (maderix). No published security policy. USE_WITH_CAUTION due to explicit instability constraints (resource leak, ~119 compilation limit, exec() self-restart workaround).
ODS runs on Linux/GCP — ANE hardware is not directly applicable to server-side services. However, two strategic angles merit tracking:
Not actionable for ODS server-side services today. Monitor for maturation into a production-usable framework before considering DocSign integration.