Skip to main content

Check out our projects available for funding!

Jupyter ecosystem

  • Modernizing nbconvert
    Conversion of Jupyter notebooks to PDF currently relies on nbconvert in the backend, which in turns uses a headless browser for producing the PDF. We propose to directly perform the PDF conversion in the user's browser, which will simplify the archit...
    Open-source under relevant licenses
    The funding organization will be credited in communication about the project
    Note: Costs and features can be further adapted following discussion with the funding organization
    30 000 €
    Indicative price
    Funded at 0 %
    Not shareable between funders
    This project does not have any backers yet
  • Raster processing tools in JupyterGIS
    JupyterGIS currently offers a set of vector processing and conversion tools. These capabilities are powered by a GDAL WebAssembly (WASM) build running in the browser. We will work on extending support to raster processing tools using the same underly...
    Open-source under relevant licenses
    The funding organization will be credited in communication about the project
    Note: Costs and features can be further adapted following discussion with the funding organization
    TBD
    Indicative price
    Funded at 0 %
    Not shareable between funders
    This project does not have any backers yet
  • Bringing processing tools to the JupyterGIS Python API
    JupyterGIS currently supports several vector processing and conversion tools, currently available only through the JupyterGIS user interface. We plan to extend these capabilities to the JupyterGIS Python API, enabling users to access the same process...
    Open-source under relevant licenses
    The funding organization will be credited in communication about the project
    Note: Costs and features can be further adapted following discussion with the funding organization
    10 000 €
    Indicative price
    Funded at 0 %
    Not shareable between funders
    This project does not have any backers yet

Package management

  • Package requests for emscripten-forge
    Emscripten-forge is a conda package distribution specifically designed for WebAssembly. While the number of available emscripten-forge packages is growing quickly, many packages are still missing from the ecosystem. We will be working on adding new p...
    Open-source under relevant licenses
    The funding organization will be credited in communication about the project
    Note: Costs and features can be further adapted following discussion with the funding organization
    TBD
    Indicative price
    Funded at 0 %
    Not shareable between funders
    This project does not have any backers yet

Scientific computing

  • SVE2 support in xsimd
    xsimd is a C++ scientific library that abstract low-level high performances computing primitives across different hardwares. We will add support for the latest SVE2 generation of ARM CPUs.
    Open-source under relevant licenses
    The funding organization will be credited in communication about the project
    Note: Costs and features can be further adapted following discussion with the funding organization
    30 000 €
    Indicative price
    Funded at 0 %
    Shareable between 2 funders
    This project does not have any backers yet
  • Implementing Kazushige Goto Algorithms for Matrix Operations in xtensor
    This project aims to integrate Kazushige Goto’s highly optimized matrix multiplication algorithms into the xtensor framework, leveraging the xsimd library for SIMD acceleration.
    Open-source under relevant licenses
    The funding organization will be credited in communication about the project
    Note: Costs and features can be further adapted following discussion with the funding organization
    45 000 €
    Indicative price
    Funded at 0 %
    Not shareable between funders
    This project does not have any backers yet

Apache Arrow and Parquet

  • Complete BinaryView / StringView support in Arrow C++
    BinaryView is a more recent and more efficient alternative to Arrow's standard Binary type. It allows for inlined storage of short strings and fast prefix comparison.
    Open-source under relevant licenses
    The funding organization will be credited in communication about the project
    Note: Costs and features can be further adapted following discussion with the funding organization
    TBD
    Indicative price
    Funded at 0 %
    Shareable between 4 funders
    This project does not have any backers yet
  • Complete Decimal32 / Decimal64 support in Arrow C++
    Decimal32 and Decimal64 are more compact and computationally more efficient data types than the standard Decimal128.
    Open-source under relevant licenses
    The funding organization will be credited in communication about the project
    Note: Costs and features can be further adapted following discussion with the funding organization
    TBD
    Indicative price
    Funded at 0 %
    Shareable between 2 funders
    This project does not have any backers yet

Can't find a project?

If you have a project in mind that you think would be relevant to our expertise, please contact us to discuss it.

Check out our projects available for funding!

Jupyter ecosystem

  • Modernizing nbconvert
    Conversion of Jupyter notebooks to PDF currently relies on nbconvert in the backend, which in turns uses a headless browser for producing the PDF. We propose to directly perform the PDF conversion in the user's browser, which will simplify the archit...
    Open-source under relevant licenses
    The funding organization will be credited in communication about the project
    Note: Costs and features can be further adapted following discussion with the funding organization
    30 000 €
    Indicative price
    Funded at 0 %
    Not shareable between funders
    This project does not have any backers yet
  • Raster processing tools in JupyterGIS
    JupyterGIS currently offers a set of vector processing and conversion tools. These capabilities are powered by a GDAL WebAssembly (WASM) build running in the browser. We will work on extending support to raster processing tools using the same underly...
    Open-source under relevant licenses
    The funding organization will be credited in communication about the project
    Note: Costs and features can be further adapted following discussion with the funding organization
    TBD
    Indicative price
    Funded at 0 %
    Not shareable between funders
    This project does not have any backers yet
  • Bringing processing tools to the JupyterGIS Python API
    JupyterGIS currently supports several vector processing and conversion tools, currently available only through the JupyterGIS user interface. We plan to extend these capabilities to the JupyterGIS Python API, enabling users to access the same process...
    Open-source under relevant licenses
    The funding organization will be credited in communication about the project
    Note: Costs and features can be further adapted following discussion with the funding organization
    10 000 €
    Indicative price
    Funded at 0 %
    Not shareable between funders
    This project does not have any backers yet

Package management

  • Package requests for emscripten-forge
    Emscripten-forge is a conda package distribution specifically designed for WebAssembly. While the number of available emscripten-forge packages is growing quickly, many packages are still missing from the ecosystem. We will be working on adding new p...
    Open-source under relevant licenses
    The funding organization will be credited in communication about the project
    Note: Costs and features can be further adapted following discussion with the funding organization
    TBD
    Indicative price
    Funded at 0 %
    Not shareable between funders
    This project does not have any backers yet

Scientific computing

  • SVE2 support in xsimd
    xsimd is a C++ scientific library that abstract low-level high performances computing primitives across different hardwares. We will add support for the latest SVE2 generation of ARM CPUs.
    Open-source under relevant licenses
    The funding organization will be credited in communication about the project
    Note: Costs and features can be further adapted following discussion with the funding organization
    30 000 €
    Indicative price
    Funded at 0 %
    Shareable between 2 funders
    This project does not have any backers yet
  • Implementing Kazushige Goto Algorithms for Matrix Operations in xtensor
    This project aims to integrate Kazushige Goto’s highly optimized matrix multiplication algorithms into the xtensor framework, leveraging the xsimd library for SIMD acceleration.
    Open-source under relevant licenses
    The funding organization will be credited in communication about the project
    Note: Costs and features can be further adapted following discussion with the funding organization
    45 000 €
    Indicative price
    Funded at 0 %
    Not shareable between funders
    This project does not have any backers yet

Apache Arrow and Parquet

  • Complete BinaryView / StringView support in Arrow C++
    BinaryView is a more recent and more efficient alternative to Arrow's standard Binary type. It allows for inlined storage of short strings and fast prefix comparison.
    Open-source under relevant licenses
    The funding organization will be credited in communication about the project
    Note: Costs and features can be further adapted following discussion with the funding organization
    TBD
    Indicative price
    Funded at 0 %
    Shareable between 4 funders
    This project does not have any backers yet
  • Complete Decimal32 / Decimal64 support in Arrow C++
    Decimal32 and Decimal64 are more compact and computationally more efficient data types than the standard Decimal128.
    Open-source under relevant licenses
    The funding organization will be credited in communication about the project
    Note: Costs and features can be further adapted following discussion with the funding organization
    TBD
    Indicative price
    Funded at 0 %
    Shareable between 2 funders
    This project does not have any backers yet

Can't find a project?

If you have a project in mind that you think would be relevant to our expertise, please contact us to discuss it.

Complete BinaryView / StringView support in Arrow C++

Overview

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics.

Representation of string and binary data in Arrow traditionally uses the Binary layout, where the entire string data resides in a separate buffer that is accessed using indirect indexing from a buffer of offsets.

Recently, the Arrow project added the Binary View layout, a more efficient layout inspired from modern execution engines where the beginning of each string is packed directly within the offsets buffer. This allows short strings to be read and processed directly without going through an additional indirection.

However, while basic support is present, Binary View is not universally supported by all Arrow components.

We propose to finish implementing support for Binary View and String View types in all components of Arrow C++:

  • scalar compute kernels:

    • equal, less_equal, etc.
    • is_in, index_in
    • ascii_*, binary_*, utf8_*
    • string_is_ascii
    • count_substring
    • extract_regex, extract_regex_span
    • split_pattern, split_pattern_regex
    • coalesce
  • vector compute kernels:

    • take, filter, scatter
    • run_end_encode, run_end_decode
    • sort_indices, rank, rank_normal, rank_quantile
    • partition_nth_indices
    • select_k_unstable
    • replace_with_mask
    • fill_null_forward, fill_null_backward, drop_null
  • aggregate compute kernels:

    • count_distinct
    • first, last, min, max
    • index
  • CSV reader and writer

  • ORC reader and writer

Funders can decide to fund the entire package, or choose the components they are interested in.

Are you interested in this project? Either entirely or partially, contact us for more information on how to help us fund it