You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[C++][Python] Add public APIs for reading and serializing IPC dictionary messages
This adds low-level APIs for working with IPC dictionary messages outside of the stream/file reader/writer context, enabling message-at-a-time IPC workflows with dictionary-encoded data.
C++ changes:
- Add public ReadDictionary(Message, DictionaryMemo*, IpcReadOptions) to read a single dictionary message into a memo
- Add CollectAndSerializeDictionaries(RecordBatch, DictionaryMemo*, IpcWriteOptions) to serialize dictionary messages with pointer-based deduplication
- Expose dictionary_memo() accessor on RecordBatchStreamReader and RecordBatchFileReader
- Refactor internal ReadDictionary to ReadDictionaryMessage in StreamDecoderInternal; make dictionary_memo_ protected
Python changes:
- Add ipc.read_dictionary_message() to populate a DictionaryMemo from a dictionary Message or Buffer
- Add RecordBatch.serialize_dictionaries() to serialize dictionary IPC messages with memo-based deduplication
- Add dictionary_memo property on RecordBatchStreamReader and RecordBatchFileReader
- Add DictionaryMemo.wrap() for non-owning references to reader memos
- Add read_dictionary_message to API docs
- Comprehensive test coverage for all new APIs
0 commit comments