nushell

mirror of https://github.com/nushell/nushell.git synced 2025-08-02 06:33:00 +02:00

Author	SHA1	Message	Date
YizhePKU	f38f88d42c	Fixes `.` expanded incorrectly as external argument (#12950 ) This PR fixes a bug where `.` is expanded into an empty string when used as an argument to external commands. Fixes https://github.com/nushell/nushell/issues/12948. --------- Co-authored-by: Ian Manske <ian.manske@pm.me>	2024-05-26 07:06:17 +08:00
Darren Schroeder	0c5a67f4e5	make polars plugin use mimalloc (#12967 ) # Description @maxim-uvarov did a ton of research and work with the dply-rs author and ritchie from polars and found out that the allocator matters on macos and it seems to be what was messing up the performance of polars plugin. ritchie suggested to use jemalloc but i switched it to mimalloc to match nushell and it seems to run better. ## Before (default allocator) note - using 1..10 vs 1..100 since it takes so long. also notice how high the `max` timings are compared to mimalloc below. ```nushell ❯ 1..10 \| each {timeit {polars open Data7602DescendingYearOrder.csv \| polars group-by year \| polars agg (polars col geo_count \| polars sum) \| polars collect \| null}} \| \| {mean: ($in \| math avg), min: ($in \| math min), max: ($in \| math max), stddev: ($in \| into int \| into float \| math stddev \| into int \| $'($in)ns' \| into duration)} ╭────────┬─────────────────────────╮ │ mean │ 4sec 999ms 605µs 995ns │ │ min │ 983ms 627µs 42ns │ │ max │ 13sec 398ms 135µs 791ns │ │ stddev │ 3sec 476ms 479µs 939ns │ ╰────────┴─────────────────────────╯ ❯ use std bench ❯ bench { polars open Data7602DescendingYearOrder.csv \| polars group-by year \| polars agg (polars col geo_count \| polars sum) \| polars collect \| null } -n 10 ╭───────┬────────────────────────╮ │ mean │ 6sec 220ms 783µs 983ns │ │ min │ 1sec 184ms 997µs 708ns │ │ max │ 18sec 882ms 81µs 708ns │ │ std │ 5sec 350ms 375µs 697ns │ │ times │ [list 10 items] │ ╰───────┴────────────────────────╯ ``` ## After (using mimalloc) ```nushell ❯ 1..100 \| each {timeit {polars open Data7602DescendingYearOrder.csv \| polars group-by year \| polars agg (polars col geo_count \| polars sum) \| polars collect \| null}} \| \| {mean: ($in \| math avg), min: ($in \| math min), max: ($in \| math max), stddev: ($in \| into int \| into float \| math stddev \| into int \| $'($in)ns' \| into duration)} ╭────────┬───────────────────╮ │ mean │ 103ms 728µs 902ns │ │ min │ 97ms 107µs 42ns │ │ max │ 149ms 430µs 84ns │ │ stddev │ 5ms 690µs 664ns │ ╰────────┴───────────────────╯ ❯ use std bench ❯ bench { polars open Data7602DescendingYearOrder.csv \| polars group-by year \| polars agg (polars col geo_count \| polars sum) \| polars collect \| null } -n 100 ╭───────┬───────────────────╮ │ mean │ 103ms 620µs 195ns │ │ min │ 97ms 541µs 166ns │ │ max │ 130ms 262µs 166ns │ │ std │ 4ms 948µs 654ns │ │ times │ [list 100 items] │ ╰───────┴───────────────────╯ ``` ## After (using jemalloc - just for comparison) ```nushell ❯ 1..100 \| each {timeit {polars open Data7602DescendingYearOrder.csv \| polars group-by year \| polars agg (polars col geo_count \| polars sum) \| polars collect \| null}} \| \| {mean: ($in \| math avg), min: ($in \| math min), max: ($in \| math max), stddev: ($in \| into int \| into float \| math stddev \| into int \| $'($in)ns' \| into duration)} ╭────────┬───────────────────╮ │ mean │ 113ms 939µs 777ns │ │ min │ 108ms 337µs 333ns │ │ max │ 166ms 467µs 458ns │ │ stddev │ 6ms 175µs 618ns │ ╰────────┴───────────────────╯ ❯ use std bench ❯ bench { polars open Data7602DescendingYearOrder.csv \| polars group-by year \| polars agg (polars col geo_count \| polars sum) \| polars collect \| null } -n 100 ╭───────┬───────────────────╮ │ mean │ 114ms 363µs 530ns │ │ min │ 108ms 804µs 833ns │ │ max │ 143ms 521µs 459ns │ │ std │ 5ms 88µs 56ns │ │ times │ [list 100 items] │ ╰───────┴───────────────────╯ ``` ## After (using parquet + mimalloc) ```nushell ❯ 1..100 \| each {timeit {polars open data.parquet \| polars group-by year \| polars agg (polars col geo_count \| polars sum) \| polars collect \| null}} \| \| {mean: ($in \| math avg), min: ($in \| math min), max: ($in \| math max), stddev: ($in \| into int \| into float \| math stddev \| into int \| $'($in)ns' \| into duration)} ╭────────┬──────────────────╮ │ mean │ 34ms 255µs 492ns │ │ min │ 31ms 787µs 250ns │ │ max │ 76ms 408µs 416ns │ │ stddev │ 4ms 472µs 916ns │ ╰────────┴──────────────────╯ ❯ use std bench ❯ bench { polars open data.parquet \| polars group-by year \| polars agg (polars col geo_count \| polars sum) \| polars collect \| null } -n 100 ╭───────┬──────────────────╮ │ mean │ 34ms 897µs 562ns │ │ min │ 31ms 518µs 542ns │ │ max │ 65ms 943µs 625ns │ │ std │ 3ms 450µs 741ns │ │ times │ [list 100 items] │ ╰───────┴──────────────────╯ ``` # User-Facing Changes <!-- List of all changes that impact the user experience here. This helps us keep track of breaking changes. --> # Tests + Formatting <!-- Don't forget to add tests that cover your changes. Make sure you've run and fixed any issues with these commands: - `cargo fmt --all -- --check` to check standard code formatting (`cargo fmt --all` applies these changes) - `cargo clippy --workspace -- -D warnings -D clippy::unwrap_used` to check that you're using the standard code style - `cargo test --workspace` to check that all tests pass (on Windows make sure to [enable developer mode](https://learn.microsoft.com/en-us/windows/apps/get-started/developer-mode-features-and-debugging)) - `cargo run -- -c "use toolkit.nu; toolkit test stdlib"` to run the tests for the standard library > Note > from `nushell` you can also use the `toolkit` as follows > ```bash > use toolkit.nu # or use an `env_change` hook to activate it automatically > toolkit check pr > ``` --> # After Submitting <!-- If your PR had any user-facing changes, update [the documentation](https://github.com/nushell/nushell.github.io) after the PR is merged, if necessary. This will help us keep the docs up to date. -->	2024-05-25 09:10:01 -05:00
Ian Manske	95977faf2d	Do not propagate glob creation error for external args (#12955 ) # Description Instead of returning an error, this PR changes `expand_glob` in `run_external.rs` to return the original string arg if glob creation failed. This makes it so that, e.g., ```nushell ^echo `[` ^echo `***` ``` no longer fail with a shell error. (This follows from #12921.)	2024-05-25 08:59:36 +08:00
Ian Manske	c5d716951f	Allow byte streams with unknown type to be compatiable with binary (#12959 ) # Description Currently, this pipeline doesn't work `open --raw file \| take 100`, since the type of the byte stream is `Unknown`, but `take` expects `Binary` streams. This PR changes commands that expect `ByteStreamType::Binary` to also work with `ByteStreamType::Unknown`. This was done by adding two new methods to `ByteStreamType`: `is_binary_coercible` and `is_string_coercible`. These return true if the type is `Unknown` or matches the type in the method name.	2024-05-24 17:54:38 -07:00
Devyn Cairns	b06f31d3c6	Make `from json --objects` streaming (#12949 ) # Description Makes the `from json --objects` command produce a stream, and read lazily from an input stream to produce its output. Also added a helper, `PipelineData::get_type()`, to make it easier to construct a wrong type error message when matching on `PipelineData`. I expect checking `PipelineData` for either a string value or an `Unknown` or `String` typed `ByteStream` will be very, very common. I would have liked to have a helper that just returns a readable stream from either, but that would either be a bespoke enum or a `Box<dyn BufRead>`, which feels like it wouldn't be so great for performance. So instead, taking the approach I did here is probably better - having a function that accepts the `impl BufRead` and matching to use it. # User-Facing Changes - `from json --objects` no longer collects its input, and can be used for large datasets or streams that produce values over time. # Tests + Formatting All passing. # After Submitting - [ ] release notes --------- Co-authored-by: Ian Manske <ian.manske@pm.me>	2024-05-24 23:37:50 +00:00
Ian Manske	84b7a99adf	Revert "Polars lazy refactor (#12669 )" (#12962 ) This reverts commit `68adc4657f`. # Description Reverts the lazyframe refactor (#12669) for the next release, since there are still a few lingering issues. This temporarily solves #12863 and #12828. After the release, the lazyframes can be added back and cleaned up.	2024-05-24 18:09:26 -05:00
Darren Schroeder	7d11c28eea	Revert "Remove `std::env::set_current_dir()` call from `EngineState::merge_env()`" (#12954 ) Reverts nushell/nushell#12922	2024-05-24 11:09:59 -05:00
Ian Manske	bf07806b1b	Use `cwd` in `grid` (#12947 ) # Description Fixes #12946. The `grid` command does not use the cwd when trying to get the icon or color for a file/path.	2024-05-23 20:38:47 +00:00
Reilly Wood	0b5a4c0d95	`explore` refactoring+clarification (#12940 ) Another very boring PR cleaning up and documenting some of `explore`'s innards. Mostly renaming things that I found confusing or vague when reading through the code, also adding some comments.	2024-05-23 08:51:39 -05:00
Wind	f53aa6fcbf	fix std help (#12943 ) # Description Fixes: #12941 ~~The issue is cause by some columns(is_builtin, is_plugin, is_custom, is_keyword) are removed in #10023~~ Edit: I'm wrong # Tests + Formatting Added one test for `std help`	2024-05-23 08:51:02 -05:00
Ian Manske	2612a167e3	Remove list support in `with-env` (#12939 ) # Description Following from #12523, this PR removes support for lists of environments variables in the `with-env` command. Rather, only records will be supported now. # After Submitting Update examples using the list form in the docs and book.	2024-05-23 13:53:55 +08:00
Reilly Wood	c7097ca937	`explore` cleanup: remove+move binary viewer config (#12920 ) Small change, removing 4 more configuration options from `explore`'s binary viewer: 1. `show_index` 2. `show_data` 3. `show_ascii` 4. `show_split` These controlled whether the 3 columns in the binary viewer (index, hex data, ASCII) and the pipe separator (`\|`) in between them are shown. I don't think we need this level of configurability until the `explore` command is more mature, and maybe even not then; we can just show them all. I think it's very unlikely that anyone is using these configuration points. Also, the row offset (e.g. how many rows we have scrolled down) was being stored in config/settings when it's arguably not config; more like internal state of the binary viewer. I moved it to a more appropriate location and renamed it.	2024-05-22 20:06:14 -07:00
Wind	58cf0c56f8	add some completion tests (#12908 ) # Description ```nushell ❯ ls ╭───┬───────┬──────┬──────┬──────────╮ │ # │ name │ type │ size │ modified │ ├───┼───────┼──────┼──────┼──────────┤ │ 0 │ a.txt │ file │ 0 B │ now │ ╰───┴───────┴──────┴──────┴──────────╯ ❯ ls a. NO RECORDS FOUND ``` There is a completion issue on previous version, I think @amtoine have reproduced it before. But currently I can't reproduce it on latest main. To avoid such regression, I added some tests for completion. --------- Co-authored-by: Antoine Stevan <44101798+amtoine@users.noreply.github.com>	2024-05-23 10:47:06 +08:00
YizhePKU	6c649809d3	Rewrite run_external.rs (#12921 ) This PR is a complete rewrite of `run_external.rs`. The main goal of the rewrite is improving readability, but it also fixes some bugs related to argument handling and the PATH variable (fixes https://github.com/nushell/nushell/issues/6011). I'll discuss some technical details to make reviewing easier. ## Argument handling Quoting arguments for external commands is hard. Like, really hard. We've had more than a dozen issues and PRs dedicated to quoting arguments (see Appendix) but the current implementation is still buggy. Here's a demonstration of the buggy behavior: ```nu let foo = "'bar'" ^touch $foo # This creates a file named `bar`, but it should be `'bar'` ^touch ...[ "'bar'" ] # Same ``` I'll describe how this PR deals with argument handling. First, we'll introduce the concept of bare strings. Bare strings are string literals that are either unquoted or quoted by backticks [^1]. Strings within a list literal are NOT considered bare strings, even if they are unquoted or quoted by backticks. When a bare string is used as an argument to external process, we need to perform tilde-expansion, glob-expansion, and inner-quotes-removal, in that order. "Inner-quotes-removal" means transforming from `--option="value"` into `--option=value`. ## `.bat` files and CMD built-ins On Windows, `.bat` files and `.cmd` files are considered executable, but they need `CMD.exe` as the interpreter. The Rust standard library supports running `.bat` files directly and will spawn `CMD.exe` under the hood (see [documentation](https://doc.rust-lang.org/std/process/index.html#windows-argument-splitting)). However, other extensions are not supported [^2]. Nushell also supports a selected number of CMD built-ins. The problem with CMD is that it uses a different set of quoting rules. Correctly quoting for CMD requires using [Command::raw_arg()](https://doc.rust-lang.org/std/os/windows/process/trait.CommandExt.html#tymethod.raw_arg) and manually quoting CMD special characters, on top of quoting from the Nushell side. ~~I decided that this is too complex and chose to reject special characters in CMD built-ins instead [^3]. Hopefully this will not affact real-world use cases.~~ I've implemented escaping that works reasonably well. ## `which-support` feature The `which` crate is now a hard dependency of `nu-command`, making the `which-support` feature essentially useless. The `which` crate is already a hard dependency of `nu-cli`, and we should consider removing the `which-support` feature entirely. ## Appendix Here's a list of quoting-related issues and PRs in rough chronological order. * https://github.com/nushell/nushell/issues/4609 * https://github.com/nushell/nushell/issues/4631 * https://github.com/nushell/nushell/issues/4601 * https://github.com/nushell/nushell/pull/5846 * https://github.com/nushell/nushell/issues/5978 * https://github.com/nushell/nushell/pull/6014 * https://github.com/nushell/nushell/issues/6154 * https://github.com/nushell/nushell/pull/6161 * https://github.com/nushell/nushell/issues/6399 * https://github.com/nushell/nushell/pull/6420 * https://github.com/nushell/nushell/pull/6426 * https://github.com/nushell/nushell/issues/6465 * https://github.com/nushell/nushell/issues/6559 * https://github.com/nushell/nushell/pull/6560 [^1]: The idea that backtick-quoted strings act like bare strings was introduced by Kubouch and briefly mentioned in [the language reference](https://www.nushell.sh/lang-guide/chapters/strings_and_text.html#backtick-quotes). [^2]: The documentation also said "running .bat scripts in this way may be removed in the future and so should not be relied upon", which is another reason to move away from this. But again, quoting for CMD is hard. [^3]: If anyone wants to try, the best resource I found on the topic is [this](https://daviddeley.com/autohotkey/parameters/parameters.htm).	2024-05-23 02:05:27 +00:00
Jakub Žádník	64afb52ffa	Fix leftover wrong column name (#12937 ) # Description Small fixup for https://github.com/nushell/nushell/pull/12930	2024-05-22 21:24:22 +00:00
Wind	ac4125f8ed	fix range semantic in detect_columns, str substring, str index-of (#12894 ) # Description Fixes: https://github.com/nushell/nushell/issues/7761 It's still unsure if we want to change the `range semantic` itself, but it's good to keep range semantic consistent between nushell commands. # User-Facing Changes ### Before ```nushell ❯ "abc" \| str substring 1..=2 b ``` ### After ```nushell ❯ "abc" \| str substring 1..=2 bc ``` # Tests + Formatting Adjust tests to fit new behavior	2024-05-22 20:00:58 +03:00
YizhePKU	7ede90cba5	Remove `std::env::set_current_dir()` call from `EngineState::merge_env()` (#12922 ) As discussed in https://github.com/nushell/nushell/pull/12749, we no longer need to call `std::env::set_current_dir()` to sync `$env.PWD` with the actual working directory. This PR removes the call from `EngineState::merge_env()`.	2024-05-22 19:58:27 +03:00
Jakub Žádník	75689ec98a	Small improvements to `debug profile` (#12930 ) <!-- if this PR closes one or more issues, you can automatically link the PR with them by using one of the [linking keywords](https://docs.github.com/en/issues/tracking-your-work-with-issues/linking-a-pull-request-to-an-issue#linking-a-pull-request-to-an-issue-using-a-keyword), e.g. - this PR should close #xxxx - fixes #xxxx you can also mention related issues, PRs or discussions! --> # Description <!-- Thank you for improving Nushell. Please, check our [contributing guide](../CONTRIBUTING.md) and talk to the core team before making major changes. Description of your pull request goes here. Provide examples and/or screenshots if your changes affect the user experience. --> 1. With the `-l` flag, `debug profile` now collects files and line numbers of profiled pipeline elements ![profiler_lines](https://github.com/nushell/nushell/assets/25571562/b400a956-d958-4aff-aa4c-7e65da3f78fa) 2. Error from the profiled closure will be reported instead of silently ignored. ![profiler_lines_error](https://github.com/nushell/nushell/assets/25571562/54f7ad7a-06a3-4d56-92c2-c3466917bee8) # User-Facing Changes <!-- List of all changes that impact the user experience here. This helps us keep track of breaking changes. --> New `--lines(-l)` flag to `debug profile`. The command will also fail if the profiled closure fails, so technically it is a breaking change. # Tests + Formatting <!-- Don't forget to add tests that cover your changes. Make sure you've run and fixed any issues with these commands: - `cargo fmt --all -- --check` to check standard code formatting (`cargo fmt --all` applies these changes) - `cargo clippy --workspace -- -D warnings -D clippy::unwrap_used` to check that you're using the standard code style - `cargo test --workspace` to check that all tests pass (on Windows make sure to [enable developer mode](https://learn.microsoft.com/en-us/windows/apps/get-started/developer-mode-features-and-debugging)) - `cargo run -- -c "use toolkit.nu; toolkit test stdlib"` to run the tests for the standard library > Note > from `nushell` you can also use the `toolkit` as follows > ```bash > use toolkit.nu # or use an `env_change` hook to activate it automatically > toolkit check pr > ``` --> # After Submitting <!-- If your PR had any user-facing changes, update [the documentation](https://github.com/nushell/nushell.github.io) after the PR is merged, if necessary. This will help us keep the docs up to date. --> --------- Co-authored-by: Ian Manske <ian.manske@pm.me>	2024-05-22 19:56:51 +03:00
Devyn Cairns	7de513a4e0	Implement streaming I/O for CSV and TSV commands (#12918 ) # Description Implements streaming for: - `from csv` - `from tsv` - `to csv` - `to tsv` via the new string-typed ByteStream support. # User-Facing Changes Commands above. Also: - `to csv` and `to tsv` now have `--columns <List(String)>`, to provide the exact columns desired in the output. This is required for them to have streaming output, because otherwise collecting the entire list is necessary to determine the output columns. If we introduce `TableStream`, this may become less necessary. # Tests + Formatting - 🟢 `toolkit fmt` - 🟢 `toolkit clippy` - 🟢 `toolkit test` - 🟢 `toolkit test stdlib` # After Submitting - [ ] release notes --------- Co-authored-by: Ian Manske <ian.manske@pm.me>	2024-05-22 16:55:24 +00:00
Devyn Cairns	758c5d447a	Add support for the `ps` command on FreeBSD, NetBSD, and OpenBSD (#12892 ) # Description I feel like it's a little sad that BSDs get to enjoy almost everything other than the `ps` command, and there are some tests that rely on this command, so I figured it would be fun to patch that and make it work. The different BSDs have diverged from each other somewhat, but generally have a similar enough API for reading process information via `sysctl()`, with some slightly different args. This supports FreeBSD with the `freebsd` module, and NetBSD and OpenBSD with the `netbsd` module. OpenBSD is a fork of NetBSD and the interface has some minor differences but many things are the same. I had wanted to try to support DragonFlyBSD too, but their Rust version in the latest release is only 1.72.0, which is too old for me to want to try to compile rustc up to 1.77.2... but I will revisit this whenever they do update it. Dragonfly is a fork of FreeBSD, so it's likely to be more or less the same - I just don't want to enable it without testing it. Fixes #6862 (partially, we probably won't be adding `zfs list`) # User-Facing Changes `ps` added for FreeBSD, NetBSD, and OpenBSD. # Tests + Formatting The CI doesn't run tests for BSDs, so I'm not entirely sure if everything was already passing before. (Frankly, it's unlikely.) But nothing appears to be broken. # After Submitting - [ ] release notes? - [ ] DragonflyBSD, whenever they do update Rust to something close enough for me to try it	2024-05-22 08:13:45 -07:00
dependabot[bot]	d7e75c0b70	Bump shadow-rs from 0.27.1 to 0.28.0 (#12932 ) Bumps [shadow-rs](https://github.com/baoyachi/shadow-rs) from 0.27.1 to 0.28.0. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/baoyachi/shadow-rs/releases">shadow-rs's releases</a>.</em></p> <blockquote> <h2>fix cargo clippy</h2> <p><a href="https://redirect.github.com/baoyachi/shadow-rs/issues/160">#160</a></p> <p>Thx <a href="https://github.com/qartik"><code>@qartik</code></a></p> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href="`ba9f8b0c2b`"><code>ba9f8b0</code></a> Update Cargo.toml</li> <li><a href="`d1b724c1e7`"><code>d1b724c</code></a> Merge pull request <a href="https://redirect.github.com/baoyachi/shadow-rs/issues/160">#160</a> from qartik/patch-1</li> <li><a href="`505108d5d6`"><code>505108d</code></a> Allow missing_docs for deprecated CLAP_VERSION constant</li> <li>See full diff in <a href="https://github.com/baoyachi/shadow-rs/compare/v0.27.1...v0.28.0">compare view</a></li> </ul> </details> <br /> [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=shadow-rs&package-manager=cargo&previous-version=0.27.1&new-version=0.28.0)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) </details> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-05-22 15:59:33 +08:00
NotTheDr01ds	f83439fdda	Add completer for std help (#12929 ) # Description While each of the `help <subcommands>` in `std` had completers, there wasn't one for the main `help` command. This adds all internals and custom commands (as with `help commands`) as possible completions. # User-Facing Changes `help ` + <kbd>Tab</kbd> will now suggest completions for both the `help <subcommands>` as well as all internal and custom commands. # Tests + Formatting Note: Cannot add tests for completion functions since they are module-internal and not visible to test cases, that I can see. - 🟢 `toolkit fmt` - 🟢 `toolkit clippy` - 🟢 `toolkit test` - 🟢 `toolkit test stdlib`	2024-05-21 10:31:14 -05:00
Stefan Holderbach	db37bead64	Remove unused dependencies (#12917 ) - Remove unused `pathdiff` dep in `nu-cli` - Remove unused `serde_json` dep on `nu-protocol` - Unnecessary after moving the plugin file to msgpack (still a dev-dependency)	2024-05-21 01:09:28 +00:00
Reilly Wood	6e050f5634	`explore`: consolidate padding config, handle ByteStream, tweak naming+comments (#12915 ) Some minor changes to `explore`, continuing on my mission to simplify the command in preparation for a larger UX overhaul: 1. Consolidate padding configuration. I don't think we need separate config points for the (optional) index column and regular data columns in the normal pager, they can share padding configuration. Likewise, in the binary viewer all 3 columns (index, data, ASCII) had their left+right padding configured independently. 2. Update `explore` so we use the binary viewer for the new `ByteStream` type. `cat foo.txt \| into binary \| explore` was not using the binary viewer after the `ByteStream` changes. 3. Tweak the naming of a few helper functions, add a comment I've put the changes in separate commits to make them easier to review. --------- Co-authored-by: Stefan Holderbach <sholderbach@users.noreply.github.com>	2024-05-20 22:03:21 +02:00
Ian Manske	905e3d0715	Remove dataframes crate and feature (#12889 ) # Description Removes the old `nu-cmd-dataframe` crate in favor of the polars plugin. As such, this PR also removes the `dataframe` feature, related CI, and full releases of nushell.	2024-05-20 17:22:08 +00:00
Darren Schroeder	4f69ba172e	add `math min` and `math max` to `bench` command (#12913 ) # Description This PR adds min and max to the bench command. ```nushell ❯ use std bench ❯ bench { dply -c 'parquet("./data.parquet") \| group_by(year) \| summarize(count = n(), sum = sum(geo_count)) \| show()' \| complete \| null } --rounds 100 --verbose 100 / 100 ╭───────┬───────────────────╮ │ mean │ 71ms 358µs 850ns │ │ min │ 66ms 457µs 583ns │ │ max │ 120ms 338µs 167ns │ │ std │ 6ms 553µs 949ns │ │ times │ [list 100 items] │ ╰───────┴───────────────────╯ ``` # User-Facing Changes <!-- List of all changes that impact the user experience here. This helps us keep track of breaking changes. --> # Tests + Formatting <!-- Don't forget to add tests that cover your changes. Make sure you've run and fixed any issues with these commands: - `cargo fmt --all -- --check` to check standard code formatting (`cargo fmt --all` applies these changes) - `cargo clippy --workspace -- -D warnings -D clippy::unwrap_used` to check that you're using the standard code style - `cargo test --workspace` to check that all tests pass (on Windows make sure to [enable developer mode](https://learn.microsoft.com/en-us/windows/apps/get-started/developer-mode-features-and-debugging)) - `cargo run -- -c "use toolkit.nu; toolkit test stdlib"` to run the tests for the standard library > Note > from `nushell` you can also use the `toolkit` as follows > ```bash > use toolkit.nu # or use an `env_change` hook to activate it automatically > toolkit check pr > ``` --> # After Submitting <!-- If your PR had any user-facing changes, update [the documentation](https://github.com/nushell/nushell.github.io) after the PR is merged, if necessary. This will help us keep the docs up to date. -->	2024-05-20 10:08:03 -05:00
Ian Manske	c98960d053	Take owned `Read` and `Write` (#12909 ) # Description As @YizhePKU pointed out, the [Rust API guidelines](https://rust-lang.github.io/api-guidelines/interoperability.html#generic-readerwriter-functions-take-r-read-and-w-write-by-value-c-rw-value) recommend that generic functions take readers and writers by value and not by reference. This PR changes `copy_with_interupt` and few other places to take owned `Read` and `Write` instead of mutable references.	2024-05-20 15:10:36 +02:00
Devyn Cairns	c61075e20e	Add string/binary type color to `ByteStream` (#12897 ) # Description This PR allows byte streams to optionally be colored as being specifically binary or string data, which guarantees that they'll be converted to `Binary` or `String` appropriately on `into_value()`, making them compatible with `Type` guarantees. This makes them significantly more broadly usable for command input and output. There is still an `Unknown` type for byte streams coming from external commands, which uses the same behavior as we previously did where it's a string if it's UTF-8. A small number of commands were updated to take advantage of this, just to prove the point. I will be adding more after this merges. # User-Facing Changes - New types in `describe`: `string (stream)`, `binary (stream)` - These commands now return a stream if their input was a stream: - `into binary` - `into string` - `bytes collect` - `str join` - `first` (binary) - `last` (binary) - `take` (binary) - `skip` (binary) - Streams that are explicitly binary colored will print as a streaming hexdump - example: ```nushell 1.. \| each { into binary } \| bytes collect ``` # Tests + Formatting I've added some tests to cover it at a basic level, and it doesn't break anything existing, but I do think more would be nice. Some of those will come when I modify more commands to stream. # After Submitting There are a few things I'm not quite satisfied with: - String trimming behavior. We automatically trim newlines from streams from external commands, but I don't think we should do this with internal commands. If I call a command that happens to turn my string into a stream, I don't want the newline to suddenly disappear. I changed this to specifically do it only on `Child` and `File`, but I don't know if this is quite right, and maybe we should bring back the old flag for `trim_end_newline` - Known binary always resulting in a hexdump. It would be nice to have a `print --raw`, so that we can put binary data on stdout explicitly if we want to. This PR doesn't change how external commands work though - they still dump straight to stdout. Otherwise, here's the normal checklist: - [ ] release notes - [ ] docs update for plugin protocol changes (added `type` field) --------- Co-authored-by: Ian Manske <ian.manske@pm.me>	2024-05-20 00:35:32 +00:00
Ian Manske	baeba19b22	Make `get_full_help` take `&dyn Command` (#12903 ) # Description Changes `get_full_help` to take a `&dyn Command` instead of multiple arguments (`&Signature`, `&Examples` `is_parser_keyword`). All of these arguments can be gathered from a `Command`, so there is no need to pass the pieces to `get_full_help`. This PR also fixes an issue where the search terms are not shown if `--help` is used on a command.	2024-05-19 19:56:33 +02:00
Ian Manske	474293bf1c	Clear environment for child `Command`s (#12901 ) # Description There is a bug when `hide-env` is used on environment variables that were present at shell startup. Namely, child processes still inherit the hidden environment variable. This PR fixes #12900, fixes #11495, and fixes #7937. # Tests + Formatting Added a test.	2024-05-19 15:35:07 +00:00
Ian Manske	cc9f41e553	Use `CommandType` in more places (#12832 ) # Description Kind of a vague title, but this PR does two main things: 1. Rather than overriding functions like `Command::is_parser_keyword`, this PR instead changes commands to override `Command::command_type`. The `CommandType` returned by `Command::command_type` is then used to automatically determine whether `Command::is_parser_keyword` and the other `is_{type}` functions should return true. These changes allow us to remove the `CommandType::Other` case and should also guarantee than only one of the `is_{type}` functions on `Command` will return true. 2. Uses the new, reworked `Command::command_type` function in the `scope commands` and `which` commands. # User-Facing Changes - Breaking change for `scope commands`: multiple columns (`is_builtin`, `is_keyword`, `is_plugin`, etc.) have been merged into the `type` column. - Breaking change: the `which` command can now report `plugin` or `keyword` instead of `built-in` in the `type` column. It may also now report `external` instead of `custom` in the `type` column for known `extern`s.	2024-05-18 23:37:31 +00:00
Ian Manske	580c60bb82	Preserve metadata in more places (#12848 ) # Description This PR makes some commands and areas of code preserve pipeline metadata. This is in an attempt to make the issue described in #12599 and #9456 less likely to occur. That is, reading and writing to the same file in a pipeline will result in an empty file. Since we preserve metadata in more places now, there will be a higher chance that we successfully detect this error case and abort the pipeline.	2024-05-17 17:59:32 +00:00
Devyn Cairns	c10aa2cf09	`collect`: don't require a closure (#12788 ) # Description This changes the `collect` command so that it doesn't require a closure. Still allowed, optionally. Before: ```nushell open foo.json \| insert foo bar \| collect { save -f foo.json } ``` After: ```nushell open foo.json \| insert foo bar \| collect \| save -f foo.json ``` The closure argument isn't really necessary, as collect values are also supported as `PipelineData`. # User-Facing Changes - `collect` command changed # Tests + Formatting Example changed to reflect. # After Submitting - [ ] release notes - [ ] we may want to deprecate the closure arg?	2024-05-17 18:46:03 +02:00
Devyn Cairns	e3db6ea04a	Exclude polars from ensure_plugins_built(), for performance reasons (#12896 ) # Description We have been building `nu_plugin_polars` unnecessarily during `cargo test`, which is very slow. All of its tests are run within its own crate, which happens during the plugins CI phase. This should speed up the CI a bit.	2024-05-17 15:04:59 +00:00
Devyn Cairns	59f7c523fa	Fix the way the output of `table` is printed in `print()` (#12895 ) # Description Forgot that I fixed this already on my branch, but when printing without a display output hook, the implicit call to `table` gets its output mangled with newlines (since #12774). This happens when running `nu -c` or a script file. Here's that fix in one PR so it can be merged easily. # Tests + Formatting - 🟢 `toolkit fmt` - 🟢 `toolkit clippy` - 🟢 `toolkit test` - 🟢 `toolkit test stdlib`	2024-05-17 07:18:18 -07:00
Wind	8adf3406e5	allow define it as a variable inside closure (#12888 ) # Description Fixes: #12690 The issue is happened after https://github.com/nushell/nushell/pull/12056 is merged. It will raise error if user doesn't supply required parameter when run closure with do. And parser adds a `$it` parameter when parsing closure or block expression. I believe the previous behavior is because we allow such syntax on previous version(0.44): ```nushell let x = { print $it } ``` But it's no longer allowed after 0.60. So I think they can be removed. # User-Facing Changes ```nushell let tmp = { let it = 42 print $it } do -c $tmp ``` should be possible again. # Tests + Formatting Added 1 test	2024-05-17 00:03:13 +00:00
Ian Manske	6891267b53	Support `ByteStream`s in `bytes starts-with` and `bytes ends-with` (#12887 ) # Description Restores `bytes starts-with` so that it is able to work with byte streams once again. For parity/consistency, this PR also adds byte stream support to `bytes ends-with`. # User-Facing Changes - `bytes ends-with` now supports byte streams. # Tests + Formatting Re-enabled tests for `bytes starts-with` and added tests for `bytes ends-with`.	2024-05-17 07:59:08 +08:00
Ian Manske	aec41f3df0	Add `Span` merging functions (#12511 ) # Description This PR adds a few functions to `Span` for merging spans together: - `Span::append`: merges two spans that are known to be in order. - `Span::concat`: returns a span that encompasses all the spans in a slice. The spans must be in order. - `Span::merge`: merges two spans (no order necessary). - `Span::merge_many`: merges an iterator of spans into a single span (no order necessary). These are meant to replace the free-standing `nu_protocol::span` function. The spans in a `LiteCommand` (the `parts`) should always be in order based on the lite parser and lexer. So, the parser code sees the most usage of `Span::append` and `Span::concat` where the order is known. In other code areas, `Span::merge` and `Span::merge_many` are used since the order between spans is often not known.	2024-05-16 22:34:49 +00:00
Ian Manske	2a09dccc11	`Bytestream` touchup (#12886 ) # Description Adds some docs and a small fix to `Chunks`.	2024-05-16 21:15:20 +00:00
Ian Manske	6fd854ed9f	Replace `ExternalStream` with new `ByteStream` type (#12774 ) # Description This PR introduces a `ByteStream` type which is a `Read`-able stream of bytes. Internally, it has an enum over three different byte stream sources: ```rust pub enum ByteStreamSource { Read(Box<dyn Read + Send + 'static>), File(File), Child(ChildProcess), } ``` This is in comparison to the current `RawStream` type, which is an `Iterator<Item = Vec<u8>>` and has to allocate for each read chunk. Currently, `PipelineData::ExternalStream` serves a weird dual role where it is either external command output or a wrapper around `RawStream`. `ByteStream` makes this distinction more clear (via `ByteStreamSource`) and replaces `PipelineData::ExternalStream` in this PR: ```rust pub enum PipelineData { Empty, Value(Value, Option<PipelineMetadata>), ListStream(ListStream, Option<PipelineMetadata>), ByteStream(ByteStream, Option<PipelineMetadata>), } ``` The PR is relatively large, but a decent amount of it is just repetitive changes. This PR fixes #7017, fixes #10763, and fixes #12369. This PR also improves performance when piping external commands. Nushell should, in most cases, have competitive pipeline throughput compared to, e.g., bash. \| Command \| Before (MB/s) \| After (MB/s) \| Bash (MB/s) \| \| -------------------------------------------------- \| -------------:\| ------------:\| -----------:\| \| `throughput \\| rg 'x'` \| 3059 \| 3744 \| 3739 \| \| `throughput \\| nu --testbin relay o> /dev/null` \| 3508 \| 8087 \| 8136 \| # User-Facing Changes - This is a breaking change for the plugin communication protocol, because the `ExternalStreamInfo` was replaced with `ByteStreamInfo`. Plugins now only have to deal with a single input stream, as opposed to the previous three streams: stdout, stderr, and exit code. - The output of `describe` has been changed for external/byte streams. - Temporary breaking change: `bytes starts-with` no longer works with byte streams. This is to keep the PR smaller, and `bytes ends-with` already does not work on byte streams. - If a process core dumped, then instead of having a `Value::Error` in the `exit_code` column of the output returned from `complete`, it now is a `Value::Int` with the negation of the signal number. # After Submitting - Update docs and book as necessary - Release notes (e.g., plugin protocol changes) - Adapt/convert commands to work with byte streams (high priority is `str length`, `bytes starts-with`, and maybe `bytes ends-with`). - Refactor the `tee` code, Devyn has already done some work on this. --------- Co-authored-by: Devyn Cairns <devyn.cairns@gmail.com>	2024-05-16 07:11:18 -07:00
Wind	1b8eb23785	allow passing float value to custom command (#12879 ) # Description Fixes: #12691 In `parse_short_flag`, it only checks special cases for `SyntaxShape::Int`, `SyntaxShape::Number` to allow a flag to be a number. This pr adds `SyntaxShape::Float` to allow a flag to be float number. # User-Facing Changes This is possible after this pr: ```nushell def spam [val: float] { $val }; spam -1.4 ``` # Tests + Formatting Added 1 test	2024-05-16 10:50:29 +02:00
Ian Manske	e20113a0eb	Remove stack debug assert (#12861 ) # Description In order for `Stack::unwrap_unique` to work as intended, we currently manually track all references to the parent stack and ensure that they are cleared before calling `Stack::unwrap_unique` in the REPL. We also only call `Stack::unwrap_unique` after all code from the current REPL entry has finished executing. Since `Value`s cannot store `Stack` references, then this should have worked in theory. However, we forgot to account for threads. `run-external` (and maybe the plugin writers) can spawn threads that clone the `Stack`, holding on to references of the parent stack. These threads are not waited/joined upon, and so may finish after the eval has already returned. This PR removes the `Stack::unwrap_unique` function and associated debug assert that was [causing panics](https://gist.github.com/cablehead/f3d2608a1629e607c2d75290829354f7) like @cablehead found. # After Submitting Make values cheaper to clone as a more robust solution to the performance issues with cloning the stack. --------- Co-authored-by: Wind <WindSoilder@outlook.com>	2024-05-15 22:59:10 +00:00
Jack Wright	6f3dbc97bb	fixed syntax shape requirements for --quantiles option for polars summary (#12878 ) Fix for #12730 All of the code expected a list of floats, but the syntax shape expected a table. Resolved by changing the syntax shape to list of floats. cc: @maxim-uvarov	2024-05-15 16:55:07 -05:00
Ian Manske	06fe7d1e16	Remove usages of `Call::positional_nth` (#12871 ) # Description Following from #12867, this PR replaces usages of `Call::positional_nth` with existing spans. This removes several `expect`s from the code. Also remove unused `positional_nth_mut` and `positional_iter_mut`	2024-05-15 19:59:42 +02:00
NotTheDr01ds	b08135d877	Fixed small error in the help-examples for the get command (#12877 ) # Description Another small error in Help, this time for the `get` command example. # User-Facing Changes Help only	2024-05-15 19:49:08 +02:00
NotTheDr01ds	72b880662b	Fixed a nitpick usage-help error - closure v. block (#12876 ) # Description So minor, but had to be fixed sometime. `help each while` used the term "block" in the "usage", but the argument type is a closure. # User-Facing Changes help-only	2024-05-15 18:16:59 +02:00
Ian Manske	0cfbdc909e	Fix `sys` panic (#12846 ) # Description This should fix #10155 where the `sys` command can panic due to date math in certain cases / on certain systems. # User-Facing Changes The `boot_time` column now has a date value instead of a formatted date string. This is technically a breaking change.	2024-05-15 15:40:04 +08:00
Wind	155934f783	make better messages for incomplete string (#12868 ) # Description Fixes: #12795 The issue is caused by an empty position of `ParseError::UnexpectedEof`. So no detailed message is displayed. To fix the issue, I adjust the start of span to `span.end - 1`. In this way, we can make sure that it never points to an empty position. After lexing item, I also reorder the unclosed character checking . Now it will be checking unclosed opening delimiters first. # User-Facing Changes After this pr, it outputs detailed error message for incomplete string when running scripts. ## Before ``` ❯ nu -c "'ab" Error: nu::parser::unexpected_eof × Unexpected end of code. ╭─[source:1:4] 1 │ 'ab ╰──── > ./target/debug/nu -c "r#'ab" Error: nu::parser::unexpected_eof × Unexpected end of code. ╭─[source:1:6] 1 │ r#'ab ╰──── ``` ## After ``` > nu -c "'ab" Error: nu::parser::unexpected_eof × Unexpected end of code. ╭─[source:1:3] 1 │ 'ab · ┬ · ╰── expected closing ' ╰──── > ./target/debug/nu -c "r#'ab" Error: nu::parser::unexpected_eof × Unexpected end of code. ╭─[source:1:5] 1 │ r#'ab · ┬ · ╰── expected closing '# ╰──── ``` # Tests + Formatting Added some tests for incomplete string. --------- Co-authored-by: Ian Manske <ian.manske@pm.me>	2024-05-15 01:14:11 +00:00
Ian Manske	c3da44cbb7	Fix `char` panic (#12867 ) # Description The `char` command can panic due to a failed `expect`: `char --integer ...[77 78 79]` This PR fixes the panic for the `--integer` flag and also the `--unicode` flag. # After Submitting Check other commands and places where similar bugs can occur due to usages of `Call::positional_nth` and related methods.	2024-05-14 21:10:06 +00:00
NotTheDr01ds	aa46bc97b3	Search terms for compact command (#12864 ) # Description There was a question in Discord today about how to remove empty rows from a table. The user found the `compact` command on their own, but I realized that there were no search terms on the command. I've added 'empty' and 'remove', although I subsequently figured out that 'empty' is found in the "usage" anyway. That said, I don't think it hurts to have good search terms behind it regardless. # User-Facing Changes Just the help # Tests + Formatting - 🟢 `toolkit fmt` - 🟢 `toolkit clippy` - 🟢 `toolkit test` - 🟢 `toolkit test stdlib` # After Submitting	2024-05-14 09:21:50 -05:00

1 2 3 4 5 ...

5798 Commits