libcudf  24.02.00
Files | Classes | Functions
Writers

Files

file  csv.hpp
 
file  io/json.hpp
 
file  orc.hpp
 
file  parquet.hpp
 

Classes

class  cudf::io::csv_writer_options
 Settings to use for write_csv(). More...
 
class  cudf::io::csv_writer_options_builder
 Builder to build options for writer_csv() More...
 
class  cudf::io::json_writer_options
 Settings to use for write_json(). More...
 
class  cudf::io::json_writer_options_builder
 Builder to build options for writer_json() More...
 
class  cudf::io::orc_writer_options
 Settings to use for write_orc(). More...
 
class  cudf::io::orc_writer_options_builder
 Builds settings to use for write_orc(). More...
 
class  cudf::io::chunked_orc_writer_options
 Settings to use for write_orc_chunked(). More...
 
class  cudf::io::chunked_orc_writer_options_builder
 Builds settings to use for write_orc_chunked(). More...
 
class  cudf::io::orc_chunked_writer
 Chunked orc writer class writes an ORC file in a chunked/stream form. More...
 
class  cudf::io::parquet_writer_options
 Settings for write_parquet(). More...
 
class  cudf::io::parquet_writer_options_builder
 Class to build parquet_writer_options. More...
 
class  cudf::io::chunked_parquet_writer_options
 Settings for write_parquet_chunked(). More...
 
class  cudf::io::chunked_parquet_writer_options_builder
 Builds options for chunked_parquet_writer_options. More...
 
class  cudf::io::parquet_chunked_writer
 chunked parquet writer class to handle options and write tables in chunks. More...
 

Functions

void cudf::io::write_csv (csv_writer_options const &options, rmm::cuda_stream_view stream=cudf::get_default_stream(), rmm::mr::device_memory_resource *mr=rmm::mr::get_current_device_resource())
 Writes a set of columns to CSV format. More...
 
void cudf::io::write_json (json_writer_options const &options, rmm::cuda_stream_view stream=cudf::get_default_stream(), rmm::mr::device_memory_resource *mr=rmm::mr::get_current_device_resource())
 Writes a set of columns to JSON format. More...
 
void cudf::io::write_orc (orc_writer_options const &options, rmm::cuda_stream_view stream=cudf::get_default_stream())
 Writes a set of columns to ORC format. More...
 
std::unique_ptr< std::vector< uint8_t > > cudf::io::write_parquet (parquet_writer_options const &options, rmm::cuda_stream_view stream=cudf::get_default_stream())
 Writes a set of columns to parquet format. More...
 
std::unique_ptr< std::vector< uint8_t > > cudf::io::merge_row_group_metadata (std::vector< std::unique_ptr< std::vector< uint8_t >>> const &metadata_list)
 Merges multiple raw metadata blobs that were previously created by write_parquet into a single metadata blob. More...
 

Detailed Description

Function Documentation

◆ merge_row_group_metadata()

std::unique_ptr<std::vector<uint8_t> > cudf::io::merge_row_group_metadata ( std::vector< std::unique_ptr< std::vector< uint8_t >>> const &  metadata_list)

Merges multiple raw metadata blobs that were previously created by write_parquet into a single metadata blob.

Parameters
[in]metadata_listList of input file metadata
Returns
A parquet-compatible blob that contains the data for all row groups in the list

◆ write_csv()

Writes a set of columns to CSV format.

The following code snippet demonstrates how to write columns to a file:

auto destination = cudf::io::sink_info("dataset.csv");
auto options = cudf::io::csv_writer_options(destination, table->view())
.na_rep(na)
.include_header(include_header)
.rows_per_chunk(rows_per_chunk);
Settings to use for write_csv().
Definition: csv.hpp:1336
void write_csv(csv_writer_options const &options, rmm::cuda_stream_view stream=cudf::get_default_stream(), rmm::mr::device_memory_resource *mr=rmm::mr::get_current_device_resource())
Writes a set of columns to CSV format.
Destination information for write interfaces.
Definition: io/types.hpp:469
Parameters
optionsSettings for controlling writing behavior
streamCUDA stream used for device memory operations and kernel launches
mrDevice memory resource to use for device memory allocation

◆ write_json()

Writes a set of columns to JSON format.

The following code snippet demonstrates how to write columns to a file:

auto destination = cudf::io::sink_info("dataset.json");
auto options = cudf::io::json_writer_options(destination, table->view())
.na_rep(na)
.lines(lines)
.rows_per_chunk(rows_per_chunk);
Settings to use for write_json().
Definition: io/json.hpp:601
void write_json(json_writer_options const &options, rmm::cuda_stream_view stream=cudf::get_default_stream(), rmm::mr::device_memory_resource *mr=rmm::mr::get_current_device_resource())
Writes a set of columns to JSON format.
Parameters
optionsSettings for controlling writing behavior
streamCUDA stream used for device memory operations and kernel launches
mrDevice memory resource to use for device memory allocation

◆ write_orc()

void cudf::io::write_orc ( orc_writer_options const &  options,
rmm::cuda_stream_view  stream = cudf::get_default_stream() 
)

Writes a set of columns to ORC format.

The following code snippet demonstrates how to write columns to a file:

auto destination = cudf::io::sink_info("dataset.orc");
auto options = cudf::io::orc_writer_options::builder(destination, table->view());
static orc_writer_options_builder builder(sink_info const &sink, table_view const &table)
Create builder to create orc_writer_options.
void write_orc(orc_writer_options const &options, rmm::cuda_stream_view stream=cudf::get_default_stream())
Writes a set of columns to ORC format.
Parameters
optionsSettings for controlling reading behavior
streamCUDA stream used for device memory operations and kernel launches

◆ write_parquet()

std::unique_ptr<std::vector<uint8_t> > cudf::io::write_parquet ( parquet_writer_options const &  options,
rmm::cuda_stream_view  stream = cudf::get_default_stream() 
)

Writes a set of columns to parquet format.

The following code snippet demonstrates how to write columns to a file:

auto destination = cudf::io::sink_info("dataset.parquet");
auto options = cudf::io::parquet_writer_options::builder(destination, table->view());
static parquet_writer_options_builder builder()
Create builder to create parquet_writer_options.
std::unique_ptr< std::vector< uint8_t > > write_parquet(parquet_writer_options const &options, rmm::cuda_stream_view stream=cudf::get_default_stream())
Writes a set of columns to parquet format.
Parameters
optionsSettings for controlling writing behavior
streamCUDA stream used for device memory operations and kernel launches
Returns
A blob that contains the file metadata (parquet FileMetadata thrift message) if requested in parquet_writer_options (empty blob otherwise).