Apache Arrow是一种用于内存数据的跨语言开发平台。它为平面和分层数据指定了标准化的独立于语言的柱状存储器格式,可在现代硬件上进行有效的分析操作。它还提供计算库和零拷贝流消息传递和进程间通信。目前支持的语言包括C,C ++,Java,JavaScript,Python和Ruby。
Apache Arrow is a universal columnar format and multi-language toolbox for fast
data interchange and in-memory analytics. It contains a set of technologies that
enable data systems to efficiently store, process, and move data.
Major components of the project include:
↗
: Arrow-powered API,↗
↗
↗
↗
↗
↗
The ↗
icon denotes that this component of the project is maintained in a separate
repository.
Arrow is an Apache Software Foundation project. Learn more at
arrow.apache.org.
The reference Arrow libraries contain many distinct software components:
The official Arrow libraries in this repository are in different stages of
implementing the Arrow format and related features. See our current
feature matrix
on git main.
Please read our latest project contribution guide.
Even if you do not plan to contribute to Apache Arrow itself or Arrow
integrations in other projects, we’d be happy to have you involved: