tech.v3.libs.poi
Parse a dataset in xls or xlsx format. This namespace auto-registers a handler for
the xls
file type so that when using ->dataset, xls
will automatically map to
(first (workbook->datasets))
.
Note that this namespace does not auto-register a handler for the xlsx
file
type. xlsx
is handled by the fastexcel namespace.
If you have an xlsx
or xls
file that contains multiple sheets and you want a
dataset out of each sheet you have to use workbook->datasets
as opposed to the
higher level ->dataset
operator.
For serializing datasets to xlsx or xls formats please see kixi.large.
Note that poi has many versions and many version conflicts and for instance the docjure library relies on a much older version of poi.
input->workbook
(input->workbook input options)
(input->workbook input)
Given an input data source, return an implementation of
tech.v3.dataset/Spreadsheet$Workbook
. This interface allows you
to iterate through sheets without necessarily parsing them.
Once you have a spreadsheet, use tech.v3.dataset.io.spreadsheet/sheet->dataset
to get a dataset.
workbook->datasets
(workbook->datasets input options)
(workbook->datasets input)
Given a workbook, a string filename or an input stream return a sequence of datasets named after the sheets.
Options are a subset of the arguments for tech.v3.dataset/->dataset:
:file-type
- either:xls
or:xlsx
- inferred from the filename when input is a string. If input is a stream defaults to:xlsx
.:header-row?
:num-rows
:n-initial-skip-rows
:parser-fn