Edit README; Fix a test; Try tracing the compiler and VM
This commit is contained in:
parent
dc3cc13b12
commit
950c36d601
172
README.md
172
README.md
@ -48,16 +48,6 @@ fn fib (n: int) -> int {
|
|||||||
write_line(fib(25))
|
write_line(fib(25))
|
||||||
```
|
```
|
||||||
|
|
||||||
Dust is an interpreted language that is compiled to bytecode by a hand-written lexer and parser. It
|
|
||||||
uses a custom multi-threaded register-based virtual machine with separate-thread garbage collection.
|
|
||||||
Competing with the runtime performance of Rust or C++ *is not* a goal. But competing with the
|
|
||||||
approachability and simplicity of those languages *is* a goal. Dust *does* intend to be faster than
|
|
||||||
Python, Ruby and NodeJS while also offering a superior development experience and more reliable code
|
|
||||||
due to its static typing. Dust's development approach is informed by some books[^1] and academic
|
|
||||||
research[^4] as well as practical insight from papers[^2] written by language authors. See the
|
|
||||||
[Inspiration](README#Inspiration) section for more information or keep reading to learn about Dust's
|
|
||||||
features.
|
|
||||||
|
|
||||||
## Goals
|
## Goals
|
||||||
|
|
||||||
This project's goal is to deliver a language that not only *works* but that offers genunine value
|
This project's goal is to deliver a language that not only *works* but that offers genunine value
|
||||||
@ -94,56 +84,6 @@ first sentence, Dust's general aspirations are to be **fast**, **safe** and **ea
|
|||||||
- **Low Resource Usage** Despite its performance, Dust's use of memory and CPU power should be
|
- **Low Resource Usage** Despite its performance, Dust's use of memory and CPU power should be
|
||||||
conservative and predictable enough to accomodate a wide range of devices.
|
conservative and predictable enough to accomodate a wide range of devices.
|
||||||
|
|
||||||
These are the project's general design goals. There are many more implementation goals. Among them
|
|
||||||
are:
|
|
||||||
|
|
||||||
- Effortless Concurrency: Dust should offer an excellent experience for writing multi-threaded
|
|
||||||
programs. The language's native functions should offer an API for spawning threads, sending
|
|
||||||
messages and waiting for results. When using these features, Dust should be much faster than any
|
|
||||||
single-threaded language. However, Dust should be fast even when running on a single thread.
|
|
||||||
Single-threaded performce is the best predictor of multi-threaded performance so continuing to
|
|
||||||
optimize how each thread executes instructions, accesses memory and moves pointers is the best
|
|
||||||
way to ensure that Dust is fast in all scenarios.
|
|
||||||
- Embeddability: The library should be easy to use so that Dust can be built into other
|
|
||||||
applications. Dust should compile to WebAssembly and offer examples of how to use it in a web
|
|
||||||
application. The user should be able to query the VM for information about the program's state
|
|
||||||
and control the program's execution. It should be possible to view and modify the value of a
|
|
||||||
variable and inspect the call stack.
|
|
||||||
- Data Fluency: Dust's value type should support conversion to and from arbitrary data in formats
|
|
||||||
like JSON, YAML, TOML and CSV. Pulling data into a Dust program should be easy, with built-in
|
|
||||||
functions offering conversion for the most widely used formats.
|
|
||||||
- Portability: Dust should run on as many architectures and operating systems as possible. Using
|
|
||||||
fewer dependencies and avoiding platform-specific code will help Dust achieve this goal. The
|
|
||||||
Dust library should be available as a WebAssembly module.
|
|
||||||
- Developer Experience: Dust should be fun and easy to use. That implies easy installation and the
|
|
||||||
availability of tutorials and how-to guides. The CLI should be predictable and feature-rich,
|
|
||||||
with features that make it easy to write and debug Dust code like formatting, bytecode
|
|
||||||
disassembly and logging.
|
|
||||||
- Advanced Type System: Dust should implement composite types, aliases and generics. The type
|
|
||||||
system should use a descriptive syntax that is easy to understand. Dust's type system should be
|
|
||||||
static, meaning that types are checked before a program reaches the VM. Dust is not a
|
|
||||||
graduallly typed language, its VM is and should remain type-agnostic.
|
|
||||||
- Thorough Testing: Primarily, the output of Dust's compiler and VM should be tested with programs
|
|
||||||
that cover all of the language's features. The tests should be actively maintained and should be
|
|
||||||
changed frequently to reflect a growing project that is constantly discovering new optimizations
|
|
||||||
and opportunities for improvement.
|
|
||||||
|
|
||||||
## Project Status
|
|
||||||
|
|
||||||
This project is maintained by a single developer. For now, its primary home is on a private git
|
|
||||||
server. The GitHub mirror is updated automatically and should carry the latest branches. There are
|
|
||||||
no other contributors at this time but the project is open to feedback and should eventually accept
|
|
||||||
contributions.
|
|
||||||
|
|
||||||
For now, both the library API and the implementation details are freely changed and the CLI has not
|
|
||||||
been published. Dust is both an ambitious project and a continuous experiment in language design.
|
|
||||||
Features may be redesigned and reimplemented at will when they do not meet the project's performance
|
|
||||||
or usability goals. This approach maximizes the development experience as a learning opportunity and
|
|
||||||
enforces a high standard of quality but slows down the process of delivering features to users.
|
|
||||||
Eventually, Dust will reach a stable release and will be ready for general use. As the project
|
|
||||||
approaches this milestone, the experimental nature of the project will be reduced and a replaced
|
|
||||||
with a focus on stability and improvement.
|
|
||||||
|
|
||||||
## Language Overview
|
## Language Overview
|
||||||
|
|
||||||
This is a quick overview of Dust's syntax features. It skips over the aspects that are familiar to
|
This is a quick overview of Dust's syntax features. It skips over the aspects that are familiar to
|
||||||
@ -167,57 +107,6 @@ So while the syntax is by no means compatible, it is superficially similar, even
|
|||||||
syntax highlighting for Rust code works well with Dust code. This is not a design goal but a happy
|
syntax highlighting for Rust code works well with Dust code. This is not a design goal but a happy
|
||||||
coincidence.
|
coincidence.
|
||||||
|
|
||||||
### Semicolons
|
|
||||||
|
|
||||||
Dust borrowed Rust's approach to semicolons and their effect on evaluation and relaxed the rules to
|
|
||||||
accomated different styles of coding. Rust, isn't designed for command lines or REPLs but Dust could
|
|
||||||
be well-suited to those applications. Dust needs to work in a source file or in an ad-hoc one-liner
|
|
||||||
sent to the CLI. Thus, semicolons are optional in most cases.
|
|
||||||
|
|
||||||
There are two things you need to know about semicolons in Dust:
|
|
||||||
|
|
||||||
- Semicolons suppress the value of whatever they follow. The preceding statement or expression will
|
|
||||||
have the type `none` and will not evaluate to a value.
|
|
||||||
- If a semicolon does not change how the program runs, it is optional.
|
|
||||||
|
|
||||||
This example shows three statements with semicolons. The compiler knows that a `let` statement
|
|
||||||
cannot produce a value and will always have the type `none`. Thanks to static typing, it also knows
|
|
||||||
that the `write_line` function has no return value so the function call also has the type `none`.
|
|
||||||
Therefore, these semicolons are optional.
|
|
||||||
|
|
||||||
```rust
|
|
||||||
let a = 40;
|
|
||||||
let b = 2;
|
|
||||||
|
|
||||||
write_line("The answer is ", a + b);
|
|
||||||
```
|
|
||||||
|
|
||||||
Removing the semicolons does not alter the execution pattern or the return value.
|
|
||||||
|
|
||||||
```rust
|
|
||||||
let x = 10
|
|
||||||
let y = 3
|
|
||||||
|
|
||||||
write_line("The remainder is ", x % y)
|
|
||||||
```
|
|
||||||
|
|
||||||
The next example produces a compiler error because the `if` block returns a value of type `int` but
|
|
||||||
the `else` block does not return a value at all. Dust does not allow branches of the same `if/else`
|
|
||||||
statement to have different types. In this case, adding a semicolon after the `777` expression fixes
|
|
||||||
the error by supressing the value.
|
|
||||||
|
|
||||||
```rust
|
|
||||||
// !!! Compile Error !!!
|
|
||||||
let input = read_line()
|
|
||||||
let reward = if input == "42" {
|
|
||||||
write_line("You got it! Here's your reward.")
|
|
||||||
|
|
||||||
777 // <- We need a semicolon here
|
|
||||||
} else {
|
|
||||||
write_line(input, " is not the answer.")
|
|
||||||
}
|
|
||||||
```
|
|
||||||
|
|
||||||
### Statements and Expressions
|
### Statements and Expressions
|
||||||
|
|
||||||
Dust is composed of statements and expressions. If a statement ends in an expression without a
|
Dust is composed of statements and expressions. If a statement ends in an expression without a
|
||||||
@ -270,13 +159,54 @@ model is more accomodating without sacrificing expressiveness. In Rust, semicolo
|
|||||||
and *meaningful*, which provides excellent consistency but lacks flexibility. In JavaScript,
|
and *meaningful*, which provides excellent consistency but lacks flexibility. In JavaScript,
|
||||||
semicolons are *required* and *meaningless*, which is a source of confusion for many developers.
|
semicolons are *required* and *meaningless*, which is a source of confusion for many developers.
|
||||||
|
|
||||||
### Control Flow
|
Dust borrowed Rust's approach to semicolons and their effect on evaluation and relaxed the rules to
|
||||||
|
accomated different styles of coding. Rust isn't designed for command lines or REPLs but Dust is
|
||||||
|
well-suited to those applications. Dust needs to work in a source file or in an ad-hoc one-liner
|
||||||
|
sent to the CLI. Thus, semicolons are optional in most cases.
|
||||||
|
|
||||||
-- TODO --
|
There are two things you need to know about semicolons in Dust:
|
||||||
|
|
||||||
### Functions
|
- Semicolons suppress the value of whatever they follow. The preceding statement or expression will
|
||||||
|
have the type `none` and will not evaluate to a value.
|
||||||
|
- If a semicolon does not change how the program runs, it is optional.
|
||||||
|
|
||||||
-- TODO --
|
This example shows three statements with semicolons. The compiler knows that a `let` statement
|
||||||
|
cannot produce a value and will always have the type `none`. Thanks to static typing, it also knows
|
||||||
|
that the `write_line` function has no return value so the function call also has the type `none`.
|
||||||
|
Therefore, these semicolons are optional.
|
||||||
|
|
||||||
|
```rust
|
||||||
|
let a = 40;
|
||||||
|
let b = 2;
|
||||||
|
|
||||||
|
write_line("The answer is ", a + b);
|
||||||
|
```
|
||||||
|
|
||||||
|
Removing the semicolons does not alter the execution pattern or the return value.
|
||||||
|
|
||||||
|
```rust
|
||||||
|
let x = 10
|
||||||
|
let y = 3
|
||||||
|
|
||||||
|
write_line("The remainder is ", x % y)
|
||||||
|
```
|
||||||
|
|
||||||
|
The next example produces a compiler error because the `if` block returns a value of type `int` but
|
||||||
|
the `else` block does not return a value at all. Dust does not allow branches of the same `if/else`
|
||||||
|
statement to have different types. In this case, adding a semicolon after the `777` expression fixes
|
||||||
|
the error by supressing the value.
|
||||||
|
|
||||||
|
```rust
|
||||||
|
// !!! Compile Error !!!
|
||||||
|
let input = read_line()
|
||||||
|
let reward = if input == "42" {
|
||||||
|
write_line("You got it! Here's your reward.")
|
||||||
|
|
||||||
|
777 // <- We need a semicolon here
|
||||||
|
} else {
|
||||||
|
write_line(input, " is not the answer.")
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
#### Type System
|
#### Type System
|
||||||
|
|
||||||
@ -296,14 +226,6 @@ Dust *does* have a `none` type, which should not be confused for being `null`-li
|
|||||||
"unit" type in Rust, `none` exists as a type but not as a value. It indicates the lack of a value
|
"unit" type in Rust, `none` exists as a type but not as a value. It indicates the lack of a value
|
||||||
from a function, expression or statement. A variable cannot be assigned to `none`.
|
from a function, expression or statement. A variable cannot be assigned to `none`.
|
||||||
|
|
||||||
#### Immutability by Default
|
|
||||||
|
|
||||||
TODO
|
|
||||||
|
|
||||||
#### Memory Safety
|
|
||||||
|
|
||||||
TODO
|
|
||||||
|
|
||||||
### Basic Values
|
### Basic Values
|
||||||
|
|
||||||
Dust supports the following basic values:
|
Dust supports the following basic values:
|
||||||
@ -324,10 +246,6 @@ singular values. Shorter strings are stored on the stack while longer strings ar
|
|||||||
Dust offers built-in native functions that can manipulate strings by accessing their bytes or
|
Dust offers built-in native functions that can manipulate strings by accessing their bytes or
|
||||||
reading them as a sequence of characters.
|
reading them as a sequence of characters.
|
||||||
|
|
||||||
### Composite Values
|
|
||||||
|
|
||||||
TODO
|
|
||||||
|
|
||||||
## Previous Implementations
|
## Previous Implementations
|
||||||
|
|
||||||
Dust has gone through several iterations, each with its own design choices. It was originally
|
Dust has gone through several iterations, each with its own design choices. It was originally
|
||||||
|
@ -309,9 +309,6 @@ fn main() {
|
|||||||
input,
|
input,
|
||||||
}) = mode
|
}) = mode
|
||||||
{
|
{
|
||||||
let run_span = span!(Level::TRACE, "CLI Run Mode");
|
|
||||||
let _run_guard = run_span.enter();
|
|
||||||
|
|
||||||
let (source, file_name) = get_source_and_file_name(input);
|
let (source, file_name) = get_source_and_file_name(input);
|
||||||
let lexer = Lexer::new(&source);
|
let lexer = Lexer::new(&source);
|
||||||
let mut compiler = match Compiler::new(lexer) {
|
let mut compiler = match Compiler::new(lexer) {
|
||||||
|
@ -25,7 +25,7 @@ mod type_checks;
|
|||||||
|
|
||||||
pub use error::CompileError;
|
pub use error::CompileError;
|
||||||
use parse_rule::{ParseRule, Precedence};
|
use parse_rule::{ParseRule, Precedence};
|
||||||
use tracing::{debug, info};
|
use tracing::{debug, info, span, Level};
|
||||||
use type_checks::{check_math_type, check_math_types};
|
use type_checks::{check_math_type, check_math_types};
|
||||||
|
|
||||||
use std::mem::replace;
|
use std::mem::replace;
|
||||||
@ -135,12 +135,6 @@ impl<'src> Compiler<'src> {
|
|||||||
pub fn new(mut lexer: Lexer<'src>) -> Result<Self, CompileError> {
|
pub fn new(mut lexer: Lexer<'src>) -> Result<Self, CompileError> {
|
||||||
let (current_token, current_position) = lexer.next_token()?;
|
let (current_token, current_position) = lexer.next_token()?;
|
||||||
|
|
||||||
info!(
|
|
||||||
"Begin chunk with {} at {}",
|
|
||||||
current_token.to_string(),
|
|
||||||
current_position.to_string()
|
|
||||||
);
|
|
||||||
|
|
||||||
Ok(Compiler {
|
Ok(Compiler {
|
||||||
function_name: None,
|
function_name: None,
|
||||||
r#type: FunctionType {
|
r#type: FunctionType {
|
||||||
@ -172,8 +166,6 @@ impl<'src> Compiler<'src> {
|
|||||||
/// will allow [`Compiler::function_name`] to be both the name used for recursive calls and the
|
/// will allow [`Compiler::function_name`] to be both the name used for recursive calls and the
|
||||||
/// name of the function when it is compiled. The name can later be seen in the VM's call stack.
|
/// name of the function when it is compiled. The name can later be seen in the VM's call stack.
|
||||||
pub fn finish(self, name: Option<impl Into<DustString>>) -> Chunk {
|
pub fn finish(self, name: Option<impl Into<DustString>>) -> Chunk {
|
||||||
info!("End chunk");
|
|
||||||
|
|
||||||
let (instructions, positions): (SmallVec<[Instruction; 32]>, SmallVec<[Span; 32]>) = self
|
let (instructions, positions): (SmallVec<[Instruction; 32]>, SmallVec<[Span; 32]>) = self
|
||||||
.instructions
|
.instructions
|
||||||
.into_iter()
|
.into_iter()
|
||||||
@ -205,6 +197,15 @@ impl<'src> Compiler<'src> {
|
|||||||
/// [`CompileError`] if any are found. After calling this function, check its return value for
|
/// [`CompileError`] if any are found. After calling this function, check its return value for
|
||||||
/// an error, then call [`Compiler::finish`] to get the compiled chunk.
|
/// an error, then call [`Compiler::finish`] to get the compiled chunk.
|
||||||
pub fn compile(&mut self) -> Result<(), CompileError> {
|
pub fn compile(&mut self) -> Result<(), CompileError> {
|
||||||
|
let span = span!(Level::INFO, "Compiling");
|
||||||
|
let _enter = span.enter();
|
||||||
|
|
||||||
|
info!(
|
||||||
|
"Begin chunk with {} at {}",
|
||||||
|
self.current_token.to_string(),
|
||||||
|
self.current_position.to_string()
|
||||||
|
);
|
||||||
|
|
||||||
loop {
|
loop {
|
||||||
self.parse(Precedence::None)?;
|
self.parse(Precedence::None)?;
|
||||||
|
|
||||||
@ -219,6 +220,8 @@ impl<'src> Compiler<'src> {
|
|||||||
}
|
}
|
||||||
}
|
}
|
||||||
|
|
||||||
|
info!("End chunk");
|
||||||
|
|
||||||
Ok(())
|
Ok(())
|
||||||
}
|
}
|
||||||
|
|
||||||
|
@ -16,6 +16,7 @@ pub use error::VmError;
|
|||||||
pub use record::Record;
|
pub use record::Record;
|
||||||
pub use run_action::RunAction;
|
pub use run_action::RunAction;
|
||||||
pub use thread::{Thread, ThreadSignal};
|
pub use thread::{Thread, ThreadSignal};
|
||||||
|
use tracing::{span, Level};
|
||||||
|
|
||||||
use crate::{compile, Chunk, DustError, Value};
|
use crate::{compile, Chunk, DustError, Value};
|
||||||
|
|
||||||
@ -40,6 +41,9 @@ impl Vm {
|
|||||||
}
|
}
|
||||||
|
|
||||||
pub fn run(mut self) -> Option<Value> {
|
pub fn run(mut self) -> Option<Value> {
|
||||||
|
let span = span!(Level::INFO, "Running");
|
||||||
|
let _enter = span.enter();
|
||||||
|
|
||||||
if self.threads.len() == 1 {
|
if self.threads.len() == 1 {
|
||||||
return self.threads[0].run();
|
return self.threads[0].run();
|
||||||
}
|
}
|
||||||
|
@ -1,4 +1,5 @@
|
|||||||
use dust_lang::*;
|
use dust_lang::*;
|
||||||
|
use smallvec::smallvec;
|
||||||
|
|
||||||
#[test]
|
#[test]
|
||||||
fn add_assign() {
|
fn add_assign() {
|
||||||
@ -13,23 +14,18 @@ fn add_assign() {
|
|||||||
value_parameters: None,
|
value_parameters: None,
|
||||||
return_type: Type::Integer,
|
return_type: Type::Integer,
|
||||||
},
|
},
|
||||||
vec![
|
smallvec![
|
||||||
(Instruction::load_constant(0, 0, false), Span(12, 13)),
|
Instruction::load_constant(0, 0, false),
|
||||||
(
|
|
||||||
Instruction::add(0, Argument::Register(0), Argument::Constant(2)),
|
Instruction::add(0, Argument::Register(0), Argument::Constant(2)),
|
||||||
Span(17, 19)
|
Instruction::get_local(1, 0),
|
||||||
),
|
Instruction::r#return(true)
|
||||||
(Instruction::get_local(1, 0), Span(23, 24)),
|
|
||||||
(Instruction::r#return(true), Span(24, 24))
|
|
||||||
],
|
],
|
||||||
vec![
|
smallvec![Span(12, 13), Span(17, 19), Span(23, 24), Span(24, 24)],
|
||||||
ConcreteValue::Integer(1),
|
smallvec![Value::integer(1), Value::string("a"), Value::integer(2)],
|
||||||
ConcreteValue::string("a"),
|
smallvec![Local::new(1, 0, true, Scope::default())],
|
||||||
ConcreteValue::Integer(2)
|
vec![]
|
||||||
],
|
|
||||||
vec![Local::new(1, 0, true, Scope::default())]
|
|
||||||
))
|
))
|
||||||
);
|
);
|
||||||
|
|
||||||
assert_eq!(run(source), Ok(Some(ConcreteValue::Integer(3))));
|
assert_eq!(run(source), Ok(Some(Value::integer(3))));
|
||||||
}
|
}
|
||||||
|
Loading…
Reference in New Issue
Block a user