项目作者: apache

项目描述 :
Apache Arrow是一种用于内存数据的跨语言开发平台。它为平面和分层数据指定了标准化的独立于语言的柱状存储器格式,可在现代硬件上进行有效的分析操作。它还提供计算库和零拷贝流消息传递和进程间通信。目前支持的语言包括C,C ++,Java,JavaScript,Python和Ruby。
高级语言: C++
项目地址: git://github.com/apache/arrow.git
创建时间: 2016-02-17T08:00:23Z
项目社区:https://github.com/apache/arrow

开源协议:Apache License 2.0

下载


Apache Arrow

Fuzzing Status
License
BlueSky Follow

Powering In-Memory Analytics

Apache Arrow is a universal columnar format and multi-language toolbox for fast
data interchange and in-memory analytics. It contains a set of technologies that
enable data systems to efficiently store, process, and move data.

Major components of the project include:

The icon denotes that this component of the project is maintained in a separate
repository.

Arrow is an Apache Software Foundation project. Learn more at
arrow.apache.org.

What’s in the Arrow libraries?

The reference Arrow libraries contain many distinct software components:

  • Columnar vector and table-like containers (similar to data frames) supporting
    flat or nested types
  • Fast, language agnostic metadata messaging layer (using Google’s Flatbuffers
    library)
  • Reference-counted off-heap buffer memory management, for zero-copy memory
    sharing and handling memory-mapped files
  • IO interfaces to local and remote filesystems
  • Self-describing binary wire formats (streaming and batch/file-like) for
    remote procedure calls (RPC) and interprocess communication (IPC)
  • Integration tests for verifying binary compatibility between the
    implementations (e.g. sending data from Java to C++)
  • Conversions to and from other in-memory data structures
  • Readers and writers for various widely-used file formats (such as Parquet, CSV)

Implementation status

The official Arrow libraries in this repository are in different stages of
implementing the Arrow format and related features. See our current
feature matrix
on git main.

How to Contribute

Please read our latest project contribution guide.

Getting involved

Even if you do not plan to contribute to Apache Arrow itself or Arrow
integrations in other projects, we’d be happy to have you involved: