KEMBAR78
Comparing 4.1.1...4.2.0 · huggingface/datasets · GitHub
Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: huggingface/datasets
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: 4.1.1
Choose a base ref
...
head repository: huggingface/datasets
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: 4.2.0
Choose a head ref
  • 13 commits
  • 55 files changed
  • 7 contributors

Commits on Sep 18, 2025

  1. set dev version (#7782)

    lhoestq authored Sep 18, 2025
    Configuration menu
    Copy the full SHA
    b5b1ba0 View commit details
    Browse the repository at this point in the history

Commits on Sep 24, 2025

  1. Document HDF5 support (#7740)

    * init docs
    
    * update
    
    * Update loading_methods.mdx
    
    ---------
    
    Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>
    klamike and lhoestq authored Sep 24, 2025
    Configuration menu
    Copy the full SHA
    5dc1a17 View commit details
    Browse the repository at this point in the history

Commits on Sep 25, 2025

  1. update tips in docs (#7790)

    lhoestq authored Sep 25, 2025
    Configuration menu
    Copy the full SHA
    27c2e70 View commit details
    Browse the repository at this point in the history

Commits on Sep 26, 2025

  1. feat: avoid some copies in torch formatter (#7787)

    * feat: avoid some copies in torch formatter
    
    * fix: handle kwargs
    
    * fix: run ruff
    
    * fix: handle dtype
    
    * fix: handle non writable np arrays
    
    * fix: remove comment map_nested
    
    * fix: adjust import for lint
    drbh authored Sep 26, 2025
    Configuration menu
    Copy the full SHA
    c412a6f View commit details
    Browse the repository at this point in the history

Commits on Oct 1, 2025

  1. Support huggingface_hub v0.x and v1.x (#7783)

    * Let's test like this
    
    * code quality
    
    * add back requests
    
    * install transformers from source
    
    * will it work?
    
    * to remove later: don't fail fast
    
    * don't fail fast
    
    * fix test fixture
    
    * fix OfflineModeIsEnabled test
    
    * huggingface_hub 1.0.0 even if deps latest
    
    * will be broken but better
    
    * pip list in CI
    
    * revert branch
    
    * install latest only in latest tests
    
    * offline
    
    * get back to normal
    
    * better
    
    * ofc
    
    * why not
    
    * as before
    
    * this time is good
    
    * fix yaml format
    
    * system
    
    * fix import in o.x
    
    * :/
    
    * Bump minimal version to 0.25.0
    
    * x-compatible offline helper
    
    * code quality
    
    * fix utils tests
    
    * fixing last bits
    
    * x-version compat
    
    * final commit
    Wauplin authored Oct 1, 2025
    Configuration menu
    Copy the full SHA
    a0c786c View commit details
    Browse the repository at this point in the history

Commits on Oct 7, 2025

  1. Define CI future (#7799)

    * define CI future
    
    * 3.12
    
    * fix tests
    lhoestq authored Oct 7, 2025
    Configuration menu
    Copy the full SHA
    095c7dc View commit details
    Browse the repository at this point in the history
  2. Sample without replacement option when interleaving datasets (#7786)

    * Sample without replacement option
    
    * Exit early for non arrow iterable.
    
    * Add new stopping strategy
    
    * Remove sample_with_replacement argument
    
    * fix CyclingMultiSourcesExamplesIterable.shard_data_sources
    
    * Add sampling without replacement logic for map style datasets.
    
    * Update process.mdx
    
    * Update stream.mdx
    
    ---------
    
    Co-authored-by: Petru Radulescu <petradul@amazon.com>
    Co-authored-by: Quentin Lhoest <lhoest.q@gmail.com>
    Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>
    4 people authored Oct 7, 2025
    Configuration menu
    Copy the full SHA
    6f2502c View commit details
    Browse the repository at this point in the history

Commits on Oct 9, 2025

  1. Add parquet scan options and docs (#7801)

    parquet scan options and docs
    lhoestq authored Oct 9, 2025
    Configuration menu
    Copy the full SHA
    4e18df1 View commit details
    Browse the repository at this point in the history
  2. More Parquet streaming docs (#7803)

    * more parquet stream arg docs
    
    * minor
    
    * minor
    lhoestq authored Oct 9, 2025
    Configuration menu
    Copy the full SHA
    cfcdfce View commit details
    Browse the repository at this point in the history
  3. Less api calls when resolving data_files (#7805)

    less api calls when resolving data_files
    lhoestq authored Oct 9, 2025
    Configuration menu
    Copy the full SHA
    02ee330 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    5eec91a View commit details
    Browse the repository at this point in the history
  5. typo (#7807)

    lhoestq authored Oct 9, 2025
    Configuration menu
    Copy the full SHA
    fd8d287 View commit details
    Browse the repository at this point in the history
  6. release: 4.2.0 (#7808)

    lhoestq authored Oct 9, 2025
    Configuration menu
    Copy the full SHA
    7e1350b View commit details
    Browse the repository at this point in the history
Loading