Add run-time type checking for command pipeline input (#14741)

<!--
if this PR closes one or more issues, you can automatically link the PR
with
them by using one of the [*linking
keywords*](https://docs.github.com/en/issues/tracking-your-work-with-issues/linking-a-pull-request-to-an-issue#linking-a-pull-request-to-an-issue-using-a-keyword),
e.g.
- this PR should close #xxxx
- fixes #xxxx

you can also mention related issues, PRs or discussions!
-->

# Description
<!--
Thank you for improving Nushell. Please, check our [contributing
guide](../CONTRIBUTING.md) and talk to the core team before making major
changes.

Description of your pull request goes here. **Provide examples and/or
screenshots** if your changes affect the user experience.
-->

This PR adds type checking of all command input types at run-time.
Generally, these errors should be caught by the parser, but sometimes we
can't know the type of a value at parse-time. The simplest example is
using the `echo` command, which has an output type of `any`, so
prefixing a literal with `echo` will bypass parse-time type checking.

Before this PR, each command has to individually check its input types.
This can result in scenarios where the input/output types don't match
the actual command behavior. This can cause valid usage with an
non-`any` type to become a parse-time error if a command is missing that
type in its pipeline input/output (`drop nth` and `history import` do
this before this PR). Alternatively, a command may not list a type in
its input/output types, but doesn't actually reject that type in its
code, which can have unintended side effects (`get` does this on an
empty pipeline input, and `sort` used to before #13154).

After this PR, the type of the pipeline input is checked to ensure it
matches one of the input types listed in the proceeding command's
input/output types. While each of the issues in the "before this PR"
section could be addressed with each command individually, this PR
solves this issue for _all_ commands.

**This will likely cause some breakage**, as some commands have
incorrect input/output types, and should be adjusted. Also, some scripts
may have erroneous usage of commands. In writing this PR, I discovered
that `toolkit.nu` was passing `null` values to `str join`, which doesn't
accept nothing types (if folks think it should, we can adjust it in this
PR or in a different PR). I found some issues in the standard library
and its tests. I also found that carapace's vendor script had an
incorrect chaining of `get -i`:

```nushell
let expanded_alias = (scope aliases | where name == $spans.0 | get -i 0 | get -i expansion)
```

Before this PR, if the `get -i 0` ever actually did evaluate to `null`,
the second `get` invocation would error since `get` doesn't operate on
`null` values. After this PR, this is immediately a run-time error,
alerting the user to the problematic code. As a side note, we'll need to
PR this fix (`get -i 0 | get -i expansion` -> `get -i 0.expansion`) to
carapace.

A notable exception to the type checking is commands with input type of
`nothing -> <type>`. In this case, any input type is allowed. This
allows piping values into the command without an error being thrown. For
example, `123 | echo $in` would be an error without this exception.
Additionally, custom types bypass type checking (I believe this also
happens during parsing, but not certain)

I added a `is_subtype` method to `Value` and `PipelineData`. It
functions slightly differently than `get_type().is_subtype()`, as noted
in the doccomments. Notably, it respects structural typing of lists and
tables. For example, the type of a value `[{a: 123} {a: 456, b: 789}]`
is a subtype of `table<a: int>`, whereas the type returned by
`Value::get_type` is a `list<any>`. Similarly, `PipelineData` has some
special handling for `ListStream`s and `ByteStream`s. The latter was
needed for this PR to work properly with external commands.

Here's some examples.

Before:
```nu
1..2 | drop nth 1
Error: nu::parser::input_type_mismatch

  × Command does not support range input.
   ╭─[entry #9:1:8]
 1 │ 1..2 | drop nth 1
   ·        ────┬───
   ·            ╰── command doesn't support range input
   ╰────

echo 1..2 | drop nth 1
# => ╭───┬───╮
# => │ 0 │ 1 │
# => ╰───┴───╯
```

After this PR, I've adjusted `drop nth`'s input/output types to accept
range input.

Before this PR, zip accepted any value despite not being listed in its
input/output types. This caused different behavior depending on if you
triggered a parse error or not:
```nushell
1 | zip [2]
# => Error: nu::parser::input_type_mismatch
# => 
# =>   × Command does not support int input.
# =>    ╭─[entry #3:1:5]
# =>  1 │ 1 | zip [2]
# =>    ·     ─┬─
# =>    ·      ╰── command doesn't support int input
# =>    ╰────
echo 1 | zip [2]
# => ╭───┬───────────╮
# => │ 0 │ ╭───┬───╮ │
# => │   │ │ 0 │ 1 │ │
# => │   │ │ 1 │ 2 │ │
# => │   │ ╰───┴───╯ │
# => ╰───┴───────────╯
```

After this PR, it works the same in both cases. For cases like this, if
we do decide we want `zip` or other commands to accept any input value,
then we should explicitly add that to the input types.
```nushell
1 | zip [2]
# => Error: nu::parser::input_type_mismatch
# => 
# =>   × Command does not support int input.
# =>    ╭─[entry #3:1:5]
# =>  1 │ 1 | zip [2]
# =>    ·     ─┬─
# =>    ·      ╰── command doesn't support int input
# =>    ╰────
echo 1 | zip [2]
# => Error: nu:🐚:only_supports_this_input_type
# => 
# =>   × Input type not supported.
# =>    ╭─[entry #14:2:6]
# =>  2 │ echo 1 | zip [2]
# =>    ·      ┬   ─┬─
# =>    ·      │    ╰── only list<any> and range input data is supported
# =>    ·      ╰── input type: int
# =>    ╰────
```

# User-Facing Changes
<!-- List of all changes that impact the user experience here. This
helps us keep track of breaking changes. -->

**Breaking change**: The type of a command's input is now checked
against the input/output types of that command at run-time. While these
errors should mostly be caught at parse-time, in cases where they can't
be detected at parse-time they will be caught at run-time instead. This
applies to both internal commands and custom commands.

Example function and corresponding parse-time error (same before and
after PR):
```nushell
def foo []: int -> nothing {
  print $"my cool int is ($in)"
}

1 | foo
# => my cool int is 1

"evil string" | foo
# => Error: nu::parser::input_type_mismatch
# => 
# =>   × Command does not support string input.
# =>    ╭─[entry #16:1:17]
# =>  1 │ "evil string" | foo
# =>    ·                 ─┬─
# =>    ·                  ╰── command doesn't support string input
# =>    ╰────
# => 
```

Before:
```nu
echo "evil string" | foo
# => my cool int is evil string
```

After:
```nu
echo "evil string" | foo
# => Error: nu:🐚:only_supports_this_input_type
# => 
# =>   × Input type not supported.
# =>    ╭─[entry #17:1:6]
# =>  1 │ echo "evil string" | foo
# =>    ·      ──────┬──────   ─┬─
# =>    ·            │          ╰── only int input data is supported
# =>    ·            ╰── input type: string
# =>    ╰────
```

Known affected internal commands which erroneously accepted any type:
* `str join`
* `zip`
* `reduce`

# Tests + Formatting
<!--
Don't forget to add tests that cover your changes.

Make sure you've run and fixed any issues with these commands:

- `cargo fmt --all -- --check` to check standard code formatting (`cargo
fmt --all` applies these changes)
- `cargo clippy --workspace -- -D warnings -D clippy::unwrap_used` to
check that you're using the standard code style
- `cargo test --workspace` to check that all tests pass (on Windows make
sure to [enable developer
mode](https://learn.microsoft.com/en-us/windows/apps/get-started/developer-mode-features-and-debugging))
- `cargo run -- -c "use toolkit.nu; toolkit test stdlib"` to run the
tests for the standard library

> **Note**
> from `nushell` you can also use the `toolkit` as follows
> ```bash
> use toolkit.nu # or use an `env_change` hook to activate it
automatically
> toolkit check pr
> ```
-->
- 🟢 `toolkit fmt`
- 🟢 `toolkit clippy`
- 🟢 `toolkit test`
- 🟢 `toolkit test stdlib`


# After Submitting
<!-- If your PR had any user-facing changes, update [the
documentation](https://github.com/nushell/nushell.github.io) after the
PR is merged, if necessary. This will help us keep the docs up to date.
-->
* Play whack-a-mole with the commands and scripts this will inevitably
break
This commit is contained in:
132ikl 2025-01-08 17:09:47 -05:00 committed by GitHub
parent d894c8befe
commit 214714e0ab
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194
25 changed files with 455 additions and 92 deletions

View File

@ -35,6 +35,7 @@ Note that history item IDs are ignored when importing from file."#
.category(Category::History)
.input_output_types(vec![
(Type::Nothing, Type::Nothing),
(Type::String, Type::Nothing),
(Type::List(Box::new(Type::String)), Type::Nothing),
(Type::table(), Type::Nothing),
])

View File

@ -13,6 +13,8 @@ impl Command for Rotate {
.input_output_types(vec![
(Type::record(), Type::table()),
(Type::table(), Type::table()),
(Type::list(Type::Any), Type::table()),
(Type::String, Type::table()),
])
.switch("ccw", "rotate counter clockwise", None)
.rest(
@ -21,6 +23,7 @@ impl Command for Rotate {
"the names to give columns once rotated",
)
.category(Category::Filters)
.allow_variants_without_examples(true)
}
fn description(&self) -> &str {

View File

@ -294,22 +294,13 @@ fn bind_args_to(
.expect("internal error: all custom parameters must have var_ids");
if let Some(result) = val_iter.next() {
let param_type = param.shape.to_type();
if required && !result.get_type().is_subtype(&param_type) {
// need to check if result is an empty list, and param_type is table or list
// nushell needs to pass type checking for the case.
let empty_list_matches = result
.as_list()
.map(|l| l.is_empty() && matches!(param_type, Type::List(_) | Type::Table(_)))
.unwrap_or(false);
if !empty_list_matches {
return Err(ShellError::CantConvert {
to_type: param.shape.to_type().to_string(),
from_type: result.get_type().to_string(),
span: result.span(),
help: None,
});
}
if required && !result.is_subtype_of(&param_type) {
return Err(ShellError::CantConvert {
to_type: param.shape.to_type().to_string(),
from_type: result.get_type().to_string(),
span: result.span(),
help: None,
});
}
stack.add_var(var_id, result);
} else if let Some(value) = &param.default_value {

View File

@ -19,18 +19,15 @@ pub fn check_example_input_and_output_types_match_command_signature(
// Skip tests that don't have results to compare to
if let Some(example_output) = example.result.as_ref() {
if let Some(example_input_type) =
if let Some(example_input) =
eval_pipeline_without_terminal_expression(example.example, cwd, engine_state)
{
let example_input_type = example_input_type.get_type();
let example_output_type = example_output.get_type();
let example_matches_signature =
signature_input_output_types
.iter()
.any(|(sig_in_type, sig_out_type)| {
example_input_type.is_subtype(sig_in_type)
&& example_output_type.is_subtype(sig_out_type)
example_input.is_subtype_of(sig_in_type)
&& example_output.is_subtype_of(sig_out_type)
&& {
witnessed_type_transformations
.insert((sig_in_type.clone(), sig_out_type.clone()));
@ -38,6 +35,9 @@ pub fn check_example_input_and_output_types_match_command_signature(
}
});
let example_input_type = example_input.get_type();
let example_output_type = example_output.get_type();
// The example type checks as a cell path operation if both:
// 1. The command is declared to operate on cell paths.
// 2. The example_input_type is list or record or table, and the example

View File

@ -38,6 +38,7 @@ impl Command for SubCommand {
(Type::Filesize, Type::String),
(Type::Date, Type::String),
(Type::Duration, Type::String),
(Type::Range, Type::String),
(
Type::List(Box::new(Type::Any)),
Type::List(Box::new(Type::String)),

View File

@ -13,10 +13,11 @@ impl Command for DropNth {
fn signature(&self) -> Signature {
Signature::build("drop nth")
.input_output_types(vec![(
Type::List(Box::new(Type::Any)),
Type::List(Box::new(Type::Any)),
)])
.input_output_types(vec![
(Type::Range, Type::list(Type::Number)),
(Type::list(Type::Any), Type::list(Type::Any)),
])
.allow_variants_without_examples(true)
.required(
"row number or row range",
// FIXME: we can make this accept either Int or Range when we can compose SyntaxShapes

View File

@ -30,6 +30,7 @@ If multiple cell paths are given, this will produce a list of values."#
),
(Type::table(), Type::Any),
(Type::record(), Type::Any),
(Type::Nothing, Type::Nothing),
])
.required(
"cell_path",
@ -163,6 +164,20 @@ fn action(
}
}
match input {
PipelineData::Empty => return Err(ShellError::PipelineEmpty { dst_span: span }),
// Allow chaining of get -i
PipelineData::Value(val @ Value::Nothing { .. }, ..) if !ignore_errors => {
return Err(ShellError::OnlySupportsThisInputType {
exp_input_type: "table or record".into(),
wrong_type: "nothing".into(),
dst_span: span,
src_span: val.span(),
})
}
_ => (),
}
if rest.is_empty() {
follow_cell_path_into_stream(input, signals, cell_path.members, span, !sensitive)
} else {

View File

@ -11,14 +11,7 @@ impl Command for Headers {
fn signature(&self) -> Signature {
Signature::build(self.name())
.input_output_types(vec![
(Type::table(), Type::table()),
(
// Tables with missing values are List<Any>
Type::List(Box::new(Type::Any)),
Type::table(),
),
])
.input_output_types(vec![(Type::table(), Type::table())])
.category(Category::Filters)
}

View File

@ -15,6 +15,7 @@ impl Command for Reject {
.input_output_types(vec![
(Type::record(), Type::record()),
(Type::table(), Type::table()),
(Type::list(Type::Any), Type::list(Type::Any)),
])
.switch(
"ignore-errors",
@ -161,6 +162,14 @@ impl Command for Reject {
Value::test_record(record! { "name" => Value::test_string("Cargo.lock") }),
])),
},
Example {
description: "Reject item in list",
example: "[1 2 3] | reject 1",
result: Some(Value::test_list(vec![
Value::test_int(1),
Value::test_int(3),
])),
},
]
}
}

View File

@ -13,7 +13,6 @@ impl Command for FromCsv {
Signature::build("from csv")
.input_output_types(vec![
(Type::String, Type::table()),
(Type::String, Type::list(Type::Any)),
])
.named(
"separator",

View File

@ -11,10 +11,7 @@ impl Command for FromTsv {
fn signature(&self) -> Signature {
Signature::build("from tsv")
.input_output_types(vec![
(Type::String, Type::table()),
(Type::String, Type::list(Type::Any)),
])
.input_output_types(vec![(Type::String, Type::table())])
.named(
"comment",
SyntaxShape::String,

View File

@ -14,6 +14,7 @@ impl Command for NuCheck {
fn signature(&self) -> Signature {
Signature::build("nu-check")
.input_output_types(vec![
(Type::Nothing, Type::Bool),
(Type::String, Type::Bool),
(Type::List(Box::new(Type::Any)), Type::Bool),
])

View File

@ -23,7 +23,7 @@ fn groups() {
#[test]
fn errors_if_given_unknown_column_name() {
let sample = r#"{
let sample = r#"[{
"nu": {
"committers": [
{"name": "Andrés N. Robalino"},
@ -41,7 +41,7 @@ fn errors_if_given_unknown_column_name() {
["punto", "cero"]
]
}
}
}]
"#;
let actual = nu!(pipeline(&format!(

View File

@ -114,7 +114,7 @@ fn error_reduce_fold_type_mismatch() {
fn error_reduce_empty() {
let actual = nu!(pipeline("reduce { |it, acc| $acc + $it }"));
assert!(actual.err.contains("needs input"));
assert!(actual.err.contains("no input value was piped in"));
}
#[test]

View File

@ -0,0 +1,61 @@
use nu_test_support::fs::Stub::EmptyFile;
use nu_test_support::playground::Playground;
use nu_test_support::{nu, pipeline};
#[test]
fn splits() {
let sample = r#"
[[first_name, last_name, rusty_at, type];
[Andrés, Robalino, "10/11/2013", A],
[JT, Turner, "10/12/2013", B],
[Yehuda, Katz, "10/11/2013", A]]
"#;
let actual = nu!(pipeline(&format!(
r#"
{sample}
| group-by rusty_at
| split-by type
| get A."10/11/2013"
| length
"#
)));
assert_eq!(actual.out, "2");
}
#[test]
fn errors_if_no_input() {
Playground::setup("split_by_no_input", |dirs, _sandbox| {
let actual = nu!(cwd: dirs.test(), pipeline("split-by type"));
assert!(actual.err.contains("no input value was piped in"));
})
}
#[test]
fn errors_if_non_record_input() {
Playground::setup("split_by_test_2", |dirs, sandbox| {
sandbox.with_files(&[
EmptyFile("los.txt"),
EmptyFile("tres.txt"),
EmptyFile("amigos.txt"),
EmptyFile("arepas.clu"),
]);
let input_mismatch = nu!(cwd: dirs.test(), pipeline("5 | split-by type"));
assert!(input_mismatch.err.contains("doesn't support int input"));
let only_supports = nu!(
cwd: dirs.test(), pipeline(
"
ls
| get name
| split-by type
"
));
assert!(only_supports.err.contains("Input type not supported"));
})
}

View File

@ -394,7 +394,7 @@ fn list_not_table_error() {
"#
));
assert!(actual.err.contains("can't convert"))
assert!(actual.err.contains("Input type not supported"))
}
#[test]

View File

@ -8,7 +8,7 @@ use nu_protocol::{
engine::{Closure, EngineState, Stack},
eval_base::Eval,
BlockId, Config, DataSource, IntoPipelineData, PipelineData, PipelineMetadata, ShellError,
Span, Type, Value, VarId, ENV_VARIABLE_ID,
Span, Value, VarId, ENV_VARIABLE_ID,
};
use nu_utils::IgnoreCaseExt;
use std::sync::Arc;
@ -65,24 +65,13 @@ pub fn eval_call<D: DebugContext>(
if let Some(arg) = call.positional_nth(param_idx) {
let result = eval_expression::<D>(engine_state, caller_stack, arg)?;
let param_type = param.shape.to_type();
if required && !result.get_type().is_subtype(&param_type) {
// need to check if result is an empty list, and param_type is table or list
// nushell needs to pass type checking for the case.
let empty_list_matches = result
.as_list()
.map(|l| {
l.is_empty() && matches!(param_type, Type::List(_) | Type::Table(_))
})
.unwrap_or(false);
if !empty_list_matches {
return Err(ShellError::CantConvert {
to_type: param.shape.to_type().to_string(),
from_type: result.get_type().to_string(),
span: result.span(),
help: None,
});
}
if required && !result.is_subtype_of(&param_type) {
return Err(ShellError::CantConvert {
to_type: param.shape.to_type().to_string(),
from_type: result.get_type().to_string(),
span: result.span(),
help: None,
});
}
callee_stack.add_var(var_id, result);
} else if let Some(value) = &param.default_value {

View File

@ -1009,6 +1009,8 @@ fn eval_call<D: DebugContext>(
let args_len = caller_stack.arguments.get_len(*args_base);
let decl = engine_state.get_decl(decl_id);
check_input_types(&input, decl.signature(), head)?;
// Set up redirect modes
let mut caller_stack = caller_stack.push_redirection(redirect_out.take(), redirect_err.take());
@ -1246,15 +1248,7 @@ fn gather_arguments(
/// Type check helper. Produces `CantConvert` error if `val` is not compatible with `ty`.
fn check_type(val: &Value, ty: &Type) -> Result<(), ShellError> {
if match val {
// An empty list is compatible with any list or table type
Value::List { vals, .. } if vals.is_empty() => {
matches!(ty, Type::Any | Type::List(_) | Type::Table(_))
}
// FIXME: the allocation that might be required here is not great, it would be nice to be
// able to just directly check whether a value is compatible with a type
_ => val.get_type().is_subtype(ty),
} {
if val.is_subtype_of(ty) {
Ok(())
} else {
Err(ShellError::CantConvert {
@ -1266,6 +1260,77 @@ fn check_type(val: &Value, ty: &Type) -> Result<(), ShellError> {
}
}
/// Type check pipeline input against command's input types
fn check_input_types(
input: &PipelineData,
signature: Signature,
head: Span,
) -> Result<(), ShellError> {
let io_types = signature.input_output_types;
// If a command doesn't have any input/output types, then treat command input type as any
if io_types.is_empty() {
return Ok(());
}
// If a command only has a nothing input type, then allow any input data
match io_types.first() {
Some((Type::Nothing, _)) if io_types.len() == 1 => {
return Ok(());
}
_ => (),
}
// Errors and custom values bypass input type checking
if matches!(
input,
PipelineData::Value(Value::Error { .. } | Value::Custom { .. }, ..)
) {
return Ok(());
}
// Check if the input type is compatible with *any* of the command's possible input types
if io_types
.iter()
.any(|(command_type, _)| input.is_subtype_of(command_type))
{
return Ok(());
}
let mut input_types = io_types
.iter()
.map(|(input, _)| input.to_string())
.collect::<Vec<String>>();
let expected_string = match input_types.len() {
0 => {
return Err(ShellError::NushellFailed {
msg: "Command input type strings is empty, despite being non-zero earlier"
.to_string(),
})
}
1 => input_types.swap_remove(0),
2 => input_types.join(" and "),
_ => {
input_types
.last_mut()
.expect("Vector with length >2 has no elements")
.insert_str(0, "and ");
input_types.join(", ")
}
};
match input {
PipelineData::Empty => Err(ShellError::PipelineEmpty { dst_span: head }),
_ => Err(ShellError::OnlySupportsThisInputType {
exp_input_type: expected_string,
wrong_type: input.get_type().to_string(),
dst_span: head,
src_span: input.span().unwrap_or(Span::unknown()),
}),
}
}
/// Get variable from [`Stack`] or [`EngineState`]
fn get_var(ctx: &EvalContext<'_>, var_id: VarId, span: Span) -> Result<Value, ShellError> {
match var_id {

View File

@ -122,6 +122,46 @@ impl PipelineData {
}
}
/// Determine if the `PipelineData` is a [subtype](https://en.wikipedia.org/wiki/Subtyping) of `other`.
///
/// This check makes no effort to collect a stream, so it may be a different result
/// than would be returned by calling [`Value::is_subtype()`] on the result of
/// [`.into_value()`](Self::into_value).
///
/// A `ListStream` acts the same as an empty list type: it is a subtype of any [`list`](Type::List)
/// or [`table`](Type::Table) type. After converting to a value, it may become a more specific type.
/// For example, a `ListStream` is a subtype of `list<int>` and `list<string>`.
/// If calling [`.into_value()`](Self::into_value) results in a `list<int>`,
/// then the value would not be a subtype of `list<string>`, in contrast to the original `ListStream`.
///
/// A `ByteStream` is a subtype of [`string`](Type::String) if it is coercible into a string.
/// Likewise, a `ByteStream` is a subtype of [`binary`](Type::Binary) if it is coercible into a binary value.
pub fn is_subtype_of(&self, other: &Type) -> bool {
match (self, other) {
(_, Type::Any) => true,
(PipelineData::Empty, Type::Nothing) => true,
(PipelineData::Value(val, ..), ty) => val.is_subtype_of(ty),
// a list stream could be a list with any type, including a table
(PipelineData::ListStream(..), Type::List(..) | Type::Table(..)) => true,
(PipelineData::ByteStream(stream, ..), Type::String)
if stream.type_().is_string_coercible() =>
{
true
}
(PipelineData::ByteStream(stream, ..), Type::Binary)
if stream.type_().is_binary_coercible() =>
{
true
}
(PipelineData::Empty, _) => false,
(PipelineData::ListStream(..), _) => false,
(PipelineData::ByteStream(..), _) => false,
}
}
pub fn into_value(self, span: Span) -> Result<Value, ShellError> {
match self {
PipelineData::Empty => Ok(Value::nothing(span)),

View File

@ -51,7 +51,11 @@ impl Type {
Self::Custom(name.into())
}
pub fn is_subtype(&self, other: &Type) -> bool {
/// Determine of the [`Type`] is a [subtype](https://en.wikipedia.org/wiki/Subtyping) of `other`.
///
/// This should only be used at parse-time. If you have a concrete [`Value`](crate::Value) or [`PipelineData`](crate::PipelineData),
/// you should use their respective [`is_subtype_of`] methods instead.
pub fn is_subtype_of(&self, other: &Type) -> bool {
// Structural subtyping
let is_subtype_collection = |this: &[(String, Type)], that: &[(String, Type)]| {
if this.is_empty() || that.is_empty() {
@ -61,7 +65,7 @@ impl Type {
} else {
that.iter().all(|(col_y, ty_y)| {
if let Some((_, ty_x)) = this.iter().find(|(col_x, _)| col_x == col_y) {
ty_x.is_subtype(ty_y)
ty_x.is_subtype_of(ty_y)
} else {
false
}
@ -74,7 +78,7 @@ impl Type {
(Type::Float, Type::Number) => true,
(Type::Int, Type::Number) => true,
(_, Type::Any) => true,
(Type::List(t), Type::List(u)) if t.is_subtype(u) => true, // List is covariant
(Type::List(t), Type::List(u)) if t.is_subtype_of(u) => true, // List is covariant
(Type::Record(this), Type::Record(that)) | (Type::Table(this), Type::Table(that)) => {
is_subtype_collection(this, that)
}
@ -227,21 +231,21 @@ mod tests {
#[test]
fn test_reflexivity() {
for ty in Type::iter() {
assert!(ty.is_subtype(&ty));
assert!(ty.is_subtype_of(&ty));
}
}
#[test]
fn test_any_is_top_type() {
for ty in Type::iter() {
assert!(ty.is_subtype(&Type::Any));
assert!(ty.is_subtype_of(&Type::Any));
}
}
#[test]
fn test_number_supertype() {
assert!(Type::Int.is_subtype(&Type::Number));
assert!(Type::Float.is_subtype(&Type::Number));
assert!(Type::Int.is_subtype_of(&Type::Number));
assert!(Type::Float.is_subtype_of(&Type::Number));
}
#[test]
@ -250,7 +254,7 @@ mod tests {
for ty2 in Type::iter() {
let list_ty1 = Type::List(Box::new(ty1.clone()));
let list_ty2 = Type::List(Box::new(ty2.clone()));
assert_eq!(list_ty1.is_subtype(&list_ty2), ty1.is_subtype(&ty2));
assert_eq!(list_ty1.is_subtype_of(&list_ty2), ty1.is_subtype_of(&ty2));
}
}
}

View File

@ -813,6 +813,66 @@ impl Value {
}
}
/// Determine of the [`Value`] is a [subtype](https://en.wikipedia.org/wiki/Subtyping) of `other`
///
/// If you have a [`Value`], this method should always be used over chaining [`Value::get_type`] with [`Type::is_subtype_of`](crate::Type::is_subtype_of).
///
/// This method is able to leverage that information encoded in a `Value` to provide more accurate
/// type comparison than if one were to collect the type into [`Type`](crate::Type) value with [`Value::get_type`].
///
/// Empty lists are considered subtypes of all list<T> types.
///
/// Lists of mixed records where some column is present in all record is a subtype of `table<column>`.
/// For example, `[{a: 1, b: 2}, {a: 1}]` is a subtype of `table<a: int>` (but not `table<a: int, b: int>`).
///
/// See also: [`PipelineData::is_subtype_of`](crate::PipelineData::is_subtype_of)
pub fn is_subtype_of(&self, other: &Type) -> bool {
// records are structurally typed
let record_compatible = |val: &Value, other: &[(String, Type)]| match val {
Value::Record { val, .. } => other
.iter()
.all(|(key, ty)| val.get(key).is_some_and(|inner| inner.is_subtype_of(ty))),
_ => false,
};
// All cases matched explicitly to ensure this does not accidentally allocate `Type` if any composite types are introduced in the future
match (self, other) {
(_, Type::Any) => true,
// `Type` allocation for scalar types is trivial
(
Value::Bool { .. }
| Value::Int { .. }
| Value::Float { .. }
| Value::String { .. }
| Value::Glob { .. }
| Value::Filesize { .. }
| Value::Duration { .. }
| Value::Date { .. }
| Value::Range { .. }
| Value::Closure { .. }
| Value::Error { .. }
| Value::Binary { .. }
| Value::CellPath { .. }
| Value::Nothing { .. },
_,
) => self.get_type().is_subtype_of(other),
// matching composite types
(val @ Value::Record { .. }, Type::Record(inner)) => record_compatible(val, inner),
(Value::List { vals, .. }, Type::List(inner)) => {
vals.iter().all(|val| val.is_subtype_of(inner))
}
(Value::List { vals, .. }, Type::Table(inner)) => {
vals.iter().all(|val| record_compatible(val, inner))
}
(Value::Custom { val, .. }, Type::Custom(inner)) => val.type_name() == **inner,
// non-matching composite types
(Value::Record { .. } | Value::List { .. } | Value::Custom { .. }, _) => false,
}
}
pub fn get_data_by_key(&self, name: &str) -> Option<Value> {
let span = self.span();
match self {
@ -3880,6 +3940,136 @@ mod tests {
}
}
mod is_subtype {
use crate::Type;
use super::*;
fn assert_subtype_equivalent(value: &Value, ty: &Type) {
assert_eq!(value.is_subtype_of(ty), value.get_type().is_subtype_of(ty));
}
#[test]
fn test_list() {
let ty_int_list = Type::list(Type::Int);
let ty_str_list = Type::list(Type::String);
let ty_any_list = Type::list(Type::Any);
let ty_list_list_int = Type::list(Type::list(Type::Int));
let list = Value::test_list(vec![
Value::test_int(1),
Value::test_int(2),
Value::test_int(3),
]);
assert_subtype_equivalent(&list, &ty_int_list);
assert_subtype_equivalent(&list, &ty_str_list);
assert_subtype_equivalent(&list, &ty_any_list);
let list = Value::test_list(vec![
Value::test_int(1),
Value::test_string("hi"),
Value::test_int(3),
]);
assert_subtype_equivalent(&list, &ty_int_list);
assert_subtype_equivalent(&list, &ty_str_list);
assert_subtype_equivalent(&list, &ty_any_list);
let list = Value::test_list(vec![Value::test_list(vec![Value::test_int(1)])]);
assert_subtype_equivalent(&list, &ty_list_list_int);
// The type of an empty lists is a subtype of any list or table type
let ty_table = Type::Table(Box::new([
("a".into(), Type::Int),
("b".into(), Type::Int),
("c".into(), Type::Int),
]));
let empty = Value::test_list(vec![]);
assert_subtype_equivalent(&empty, &ty_any_list);
assert!(empty.is_subtype_of(&ty_int_list));
assert!(empty.is_subtype_of(&ty_table));
}
#[test]
fn test_record() {
let ty_abc = Type::Record(Box::new([
("a".into(), Type::Int),
("b".into(), Type::Int),
("c".into(), Type::Int),
]));
let ty_ab = Type::Record(Box::new([("a".into(), Type::Int), ("b".into(), Type::Int)]));
let ty_inner = Type::Record(Box::new([("inner".into(), ty_abc.clone())]));
let record_abc = Value::test_record(record! {
"a" => Value::test_int(1),
"b" => Value::test_int(2),
"c" => Value::test_int(3),
});
let record_ab = Value::test_record(record! {
"a" => Value::test_int(1),
"b" => Value::test_int(2),
});
assert_subtype_equivalent(&record_abc, &ty_abc);
assert_subtype_equivalent(&record_abc, &ty_ab);
assert_subtype_equivalent(&record_ab, &ty_abc);
assert_subtype_equivalent(&record_ab, &ty_ab);
let record_inner = Value::test_record(record! {
"inner" => record_abc
});
assert_subtype_equivalent(&record_inner, &ty_inner);
}
#[test]
fn test_table() {
let ty_abc = Type::Table(Box::new([
("a".into(), Type::Int),
("b".into(), Type::Int),
("c".into(), Type::Int),
]));
let ty_ab = Type::Table(Box::new([("a".into(), Type::Int), ("b".into(), Type::Int)]));
let ty_list_any = Type::list(Type::Any);
let record_abc = Value::test_record(record! {
"a" => Value::test_int(1),
"b" => Value::test_int(2),
"c" => Value::test_int(3),
});
let record_ab = Value::test_record(record! {
"a" => Value::test_int(1),
"b" => Value::test_int(2),
});
let table_abc = Value::test_list(vec![record_abc.clone(), record_abc.clone()]);
let table_ab = Value::test_list(vec![record_ab.clone(), record_ab.clone()]);
assert_subtype_equivalent(&table_abc, &ty_abc);
assert_subtype_equivalent(&table_abc, &ty_ab);
assert_subtype_equivalent(&table_ab, &ty_abc);
assert_subtype_equivalent(&table_ab, &ty_ab);
assert_subtype_equivalent(&table_abc, &ty_list_any);
let table_mixed = Value::test_list(vec![record_abc.clone(), record_ab.clone()]);
assert_subtype_equivalent(&table_mixed, &ty_abc);
assert!(table_mixed.is_subtype_of(&ty_ab));
let ty_a = Type::Table(Box::new([("a".into(), Type::Any)]));
let table_mixed_types = Value::test_list(vec![
Value::test_record(record! {
"a" => Value::test_int(1),
}),
Value::test_record(record! {
"a" => Value::test_string("a"),
}),
]);
assert!(table_mixed_types.is_subtype_of(&ty_a));
}
}
mod into_string {
use chrono::{DateTime, FixedOffset};

View File

@ -28,7 +28,7 @@ def valid-annotations [] {
# Returns a table containing the list of function names together with their annotations (comments above the declaration)
def get-annotated [
file: path
]: path -> table<function_name: string, annotation: string> {
]: nothing -> table<function_name: string, annotation: string> {
let raw_file = (
open $file
| lines

View File

@ -37,7 +37,7 @@ def iter_intersperse [] {
let res = (1..4 | iter intersperse 0)
assert equal $res [1 0 2 0 3 0 4]
let res = (4 | iter intersperse 1)
let res = ([4] | iter intersperse 1)
assert equal $res [4]
}
@ -92,7 +92,7 @@ def iter_zip_with [] {
assert equal $res [3 5 7]
let res = (42 | iter zip-with [1 2 3] {|a, b| $a // $b})
let res = ([42] | iter zip-with [1 2 3] {|a, b| $a // $b})
assert equal $res [42]
let res = (2..5 | iter zip-with 4 {|a, b| $a * $b})

View File

@ -35,10 +35,13 @@ impl PluginCommand for ExprWhen {
SyntaxShape::Any,
"expression that will be applied when predicate is true",
)
.input_output_type(
Type::Custom("expression".into()),
Type::Custom("expression".into()),
)
.input_output_types(vec![
(Type::Nothing, Type::Custom("expression".into())),
(
Type::Custom("expression".into()),
Type::Custom("expression".into()),
),
])
.category(Category::Custom("expression".into()))
}

View File

@ -46,7 +46,7 @@ export def clippy [
^cargo clippy
--workspace
--exclude nu_plugin_*
--features ($features | str join ",")
--features ($features | default [] | str join ",")
--
-D warnings
-D clippy::unwrap_used
@ -62,7 +62,7 @@ export def clippy [
--tests
--workspace
--exclude nu_plugin_*
--features ($features | str join ",")
--features ($features | default [] | str join ",")
--
-D warnings
)
@ -96,13 +96,13 @@ export def test [
if $workspace {
^cargo nextest run --all
} else {
^cargo nextest run --features ($features | str join ",")
^cargo nextest run --features ($features | default [] | str join ",")
}
} else {
if $workspace {
^cargo test --workspace
} else {
^cargo test --features ($features | str join ",")
^cargo test --features ($features | default [] | str join ",")
}
}
}
@ -345,7 +345,7 @@ export def build [
...features: string@"nu-complete list features" # a space-separated list of feature to install with Nushell
--all # build all plugins with Nushell
] {
build-nushell ($features | str join ",")
build-nushell ($features | default [] | str join ",")
if not $all {
return
@ -384,7 +384,7 @@ export def install [
--all # install all plugins with Nushell
] {
touch crates/nu-cmd-lang/build.rs # needed to make sure `version` has the correct `commit_hash`
^cargo install --path . --features ($features | str join ",") --locked --force
^cargo install --path . --features ($features | default [] | str join ",") --locked --force
if not $all {
return
}