Jul 20, 2023

C++ library to use SIMD vectorization

Inastemp provides a set of C++ classes to make vectorization with intrinsics easier. It aims at developing numerical kernels by separating the algorithm from the hardware target. Inastemp comes with several examples and patterns related to widespread use-cases.


  • The following x86 SIMD types are currently supported
    • SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX512-KNL, AVX512-SKL
  • The following Powere PC SIMD types are currently supported
    • Power-8 Altivec/VMX
  • arithmetic operators */+- are provided
  • CPU capacities are detected automatically during the CMake stage
  • The compiler capacities are detected automatically during the CMake stage
  • The library purely contains of headers, no linkage is necessary.
  • CPU detection may use Intel SDE
  • Unit-tests may use Intel SDE
  • Fast intrinsic exp function if not supported natively by the compiler
  • Explicit branches vectorization several patterns which represent many applications are demonstrated

Checkout these related ports:
  • Zziplib - Library to provide transparent read access to zipped files
  • Zydis - Fast and lightweight x86/x86-64 disassembler library
  • Zycore-c - Support library with platform independent types, macros, etc for Zydis
  • Zthread - Platform-independent object-oriented C++ threading library
  • Zookeeper - Coordination Service for Distributed Applications
  • Zls - Zig LSP implementation + Zig Language Server
  • Zfp - High throughput library for compressed floating-point arrays
  • Zeal - Offline documentation browser
  • Zapcc - C++ caching compiler based on clang
  • Zanata-platform - Web-based translation platform
  • Zanata-cli - Zanata Java command line client
  • Z88dk - Complete Z80/Z180 development kit
  • Z80ex - ZiLOG Z80 CPU emulator library
  • Z80asm - Assembler for the Z80 microprocessor
  • Z80-asm - Z80 assembly code assembler and disassembler