Py-pydoop

Jul 20, 2023

Python interface to Hadoop

Pydoop is a Python interface to Hadoop that allows you to write MapReduce applications in pure Python.

Pydoop offers several features not commonly found in other Python libraries for Hadoop

– a rich HDFS API; – a MapReduce API that allows to write pure Python record readers / writers, partitioners and combiners; – transparent Avro deserialization; – easy installation-free usage;

WWW http//crs4.github.io/pydoop/



Checkout these related ports:
  • Zziplib - Library to provide transparent read access to zipped files
  • Zydis - Fast and lightweight x86/x86-64 disassembler library
  • Zycore-c - Support library with platform independent types, macros, etc for Zydis
  • Zthread - Platform-independent object-oriented C++ threading library
  • Zookeeper - Coordination Service for Distributed Applications
  • Zls - Zig LSP implementation + Zig Language Server
  • Zfp - High throughput library for compressed floating-point arrays
  • Zeal - Offline documentation browser
  • Zapcc - C++ caching compiler based on clang
  • Zanata-platform - Web-based translation platform
  • Zanata-cli - Zanata Java command line client
  • Z88dk - Complete Z80/Z180 development kit
  • Z80ex - ZiLOG Z80 CPU emulator library
  • Z80asm - Assembler for the Z80 microprocessor
  • Z80-asm - Z80 assembly code assembler and disassembler