函数式编程的Rust之旅:闭包与迭代器的深入解析与实践在Rust编程语言中,闭包(Closures)和迭代器(Iterators)是两大利器,它们不仅体现了函数式编程的优雅,还为开发者提供了强大的抽象能力和代码复用性。无论你是Rust新手还是有一定经验的开发者,理解和掌握闭包与迭代器都能让你的代码
在Rust编程语言中,闭包(Closures)和迭代器(Iterators)是两大利器,它们不仅体现了函数式编程的优雅,还为开发者提供了强大的抽象能力和代码复用性。无论你是Rust新手还是有一定经验的开发者,理解和掌握闭包与迭代器都能让你的代码更简洁、高效。本文将带你深入探索闭包和迭代器的核心概念、用法及优化技巧,并通过一个实际项目展示它们的威力。让我们一起开启这场Rust的函数式编程之旅吧!
本文从闭包的基础定义入手,讲解了其类型推断、捕获环境及抽象行为的创建方法,随后深入剖析迭代器的Iterator trait及next方法,揭示其懒惰特性和性能优势。我们还将探讨如何结合闭包与迭代器优化代码,并通过改进 minigrep 项目展示其实战价值。最后,对闭包和迭代器的运行时性能进行分析,帮助你更好地在项目中应用这些特性,提升代码质量与效率。
闭包是Rust中可以捕获环境的匿名函数,具有以下特点:
~/rust
➜ cargo new closure
Created binary (application) `closure` package
~/rust
➜ cd closure
closure on master [?] via 🦀 1.67.1
➜ c
closure on master [?] via 🦀 1.67.1
➜
use std::thread;
use std::time::Duration;
fn main() {
let simulated_user_specified_value = 10;
let simulated_random_number = 7;
generate_workout(simulated_user_specified_value, simulated_random_number);
}
fn simnulated_expensive_calculation(intensity: u32) -> u32 {
println!("calculating slowly ...");
thread::sleep(Duration::from_secs(2));
intensity
}
fn generate_workout(intensity: u32, random_number: u32) {
if intensity < 25 {
println!("Today, do {} pushups!", simnulated_expensive_calculation(intensity));
println!("Next, do {} situps!", simnulated_expensive_calculation(intensity));
} else {
if random_number == 3 {
println!("Take a break today! Remember to stay hydrated!");
} else {
println!("Today, run for {} minutes!", simnulated_expensive_calculation(intensity));
}
}
}
未优化版本:每次调用都重复计算,效率低下。
use std::thread;
use std::time::Duration;
fn main() {
let simulated_user_specified_value = 10;
let simulated_random_number = 7;
generate_workout(simulated_user_specified_value, simulated_random_number);
}
fn simnulated_expensive_calculation(intensity: u32) -> u32 {
println!("calculating slowly ...");
thread::sleep(Duration::from_secs(2));
intensity
}
fn generate_workout(intensity: u32, random_number: u32) {
let expensive_result = simnulated_expensive_calculation(intensity);
if intensity < 25 {
println!("Today, do {} pushups!", expensive_result);
println!("Next, do {} situps!", expensive_result);
} else {
if random_number == 3 {
println!("Take a break today! Remember to stay hydrated!");
} else {
println!("Today, run for {} minutes!", expensive_result);
}
}
}
闭包优化版:将计算逻辑封装为闭包,按需调用。
use std::thread;
use std::time::Duration;
fn main() {
let simulated_user_specified_value = 10;
let simulated_random_number = 7;
generate_workout(simulated_user_specified_value, simulated_random_number);
}
fn generate_workout(intensity: u32, random_number: u32) {
let expensive_closure = |num| {
println!("calculating slowly ...");
thread::sleep(Duration::from_secs(2));
num
};
if intensity < 25 {
println!("Today, do {} pushups!", expensive_closure(intensity));
println!("Next, do {} situps!", expensive_closure(intensity));
} else {
if random_number == 3 {
println!("Take a break today! Remember to stay hydrated!");
} else {
println!("Today, run for {} minutes!", expensive_closure(intensity));
}
}
}
Rust编译器能自动推断闭包参数和返回值的类型,但也可手动标注:
let expensive_closure = |num: u32| -> u32 {
println!("calculating slowly ...");
thread::sleep(Duration::from_secs(2));
num
};
函数与闭包对比
fn add_one_v1(x: u32) -> u32 { x + 1 } // 函数
let add_one_v2 = |x: u32| -> u32 { x + 1 }; // 闭包
let add_one_v3 = |x| { x + 1 }; // 闭包
let add_one_v4 = |x| x + 1 ; // 简洁闭包
fn main() {
let example_closure = |x| x;
let s = example_closure(String::from("hello"));
let n = example_closure(5) // 报错
}
注意:闭包类型一旦推断固定,不能更改参数类型。
为避免重复计算,我们可以用结构体缓存闭包结果:
use std::thread;
use std::time::Duration;
struct Cacher<T>
where
T: Fn(u32) -> u32,
{
calculation: T,
value: Option<u32>,
}
impl<T> Cacher<T>
where
T: Fn(u32) -> u32,
{
fn new(calculation: T) -> Cacher<T> {
Cacher {
calculation,
value: None,
}
}
fn value(&mut self, arg: u32) -> u32 {
match self.value {
Some(v) => v,
None => {
let v = (self.calculation)(arg);
self.value = Some(v);
v
}
}
}
}
fn main() {
let simulated_user_specified_value = 10;
let simulated_random_number = 7;
generate_workout(simulated_user_specified_value, simulated_random_number);
}
fn generate_workout(intensity: u32, random_number: u32) {
let mut expensive_closure = Cacher::new(|num| {
println!("calculating slowly ...");
thread::sleep(Duration::from_secs(2));
num
});
if intensity < 25 {
println!("Today, do {} pushups!", expensive_closure.value(intensity));
println!("Next, do {} situps!", expensive_closure.value(intensity));
} else {
if random_number == 3 {
println!("Take a break today! Remember to stay hydrated!");
} else {
println!("Today, run for {} minutes!", expensive_closure.value(intensity));
}
}
}
use std::thread;
use std::time::Duration;
struct Cacher<T>
where
T: Fn(u32) -> u32,
{
calculation: T,
value: Option<u32>,
}
impl<T> Cacher<T>
where
T: Fn(u32) -> u32,
{
fn new(calculation: T) -> Cacher<T> {
Cacher {
calculation,
value: None,
}
}
fn value(&mut self, arg: u32) -> u32 {
match self.value {
Some(v) => v,
None => {
let v = (self.calculation)(arg);
self.value = Some(v);
v
}
}
}
}
fn main() {
let simulated_user_specified_value = 10;
let simulated_random_number = 7;
generate_workout(simulated_user_specified_value, simulated_random_number);
}
fn generate_workout(intensity: u32, random_number: u32) {
let mut expensive_closure = Cacher::new(|num| {
println!("calculating slowly ...");
thread::sleep(Duration::from_secs(2));
num
});
if intensity < 25 {
println!("Today, do {} pushups!", expensive_closure.value(intensity));
println!("Next, do {} situps!", expensive_closure.value(intensity));
} else {
if random_number == 3 {
println!("Take a break today! Remember to stay hydrated!");
} else {
println!(
"Today, run for {} minutes!",
expensive_closure.value(intensity)
);
}
}
}
#[cfg(test)]
mod tests {
#[test]
fn call_with_different_values() {
let mut c = super::Cacher::new(|a| a);
let v1 = c.value(1);
let v2 = c.value(2);
assert_eq!(v2, 2);
}
}
应用:只计算一次,结果可复用,大幅提升性能。
闭包能访问定义时的变量:
fn main() {
let x = 4;
let equal_to_x = |z| z == x; // 捕获x
let y = 4;
assert!(equal_to_x(y));
}
fn main() {
let x = vec![1, 2, 3];
let equal_to_x = move |z| z == x;
println!("can't use x here: {:?}", x); // 报错
let y = vec![1, 2, 3];
assert!(equal_to_x(y))
}
move关键字:强制转移所有权,常用于线程。
迭代器负责遍历序列,Rust中是懒惰的,需显式调用才会执行。
fn main() {
let v1 = vec![1, 2, 3];
let v1_iter = v1.iter(); // 创建迭代器,未执行
for val in v1_iter {
pringln!("Got: {}", val); // 遍历时执行
}
}
pub trait Iterator {
type item;
fn next(&mut self) -> Option<Self::Item>;
// methods with default implementations elided
}
Type Item 和 Self::Item 定义了与此该 Trait 关联的类型。
Iterator trait 仅要求实现一个方法:next
next:
可直接在迭代器上调用 next 方法
#[cfg(test)]
mod tests {
#[test]
fn iterator_demonstration() {
let v1 = vec![1, 2, 3];
let mut v1_iter = v1.iter();
assert_eq!(v1_iter.next(), Some(&1));
assert_eq!(v1_iter.next(), Some(&2));
assert_eq!(v1_iter.next(), Some(&3));
}
}
消耗型方法 如sum:耗尽迭代器并返回结果。
#[cfg(test)]
mod tests {
#[test]
fn iterator_sum() {
let v1 = vec![1, 2, 3];
let v1_iter = v1,iter();
let total: i32 = v1_iter.sum(); // 6
assert_eq!(total, 6);
}
}
产生型方法 如map:生成新迭代器。
#[cfg(test)]
mod tests {
#[test]
fn iterator_sum() {
let v1: Vec<i32> = vec![1, 2, 3];
let v2: Vec<_> = v1.iter().map(|x| x + 1).collect(); // [2, 3, 4]
assert_eq!(v2, vec![2, 3, 4]);
}
}
filter示例:筛选符合条件的元素。
#[derive(PartialEq, Debug)]
struct Shoe {
size: u32,
style: String,
}
fn shoes_in_my_size(shoes: Vec<Shoe>, shoe_size: u32) -> Vec<Shoe> {
shoes.into_iter().filter(|x| x.size == shoe_size).collect()
}
#[test]
fn filter_by_size() {
let shoes = vec![
Shoe {
size: 10,
style: String::from("sneaker"),
},
Shoe {
size: 13,
style: String::from("sandal"),
},
Shoe {
size: 10,
style: String::from("boot").
},
];
let in_my_size = shoes_in_my_size(shoes, 10);
assert_eq!(in_my_size, vec![
Shoe {
size: 10,
style: String::from("sneaker")
},
Shoe {
size: 10,
style: String::from("boot")
},
]);
}
struct Counter {
count: u32,
}
impl Counter {
fn new() -> Counter {
Counter { count: 0 }
}
}
impl Iterator for Counter {
type Item = u32;
fn next(&mut self) -> Option<Self::Item> {
if self.count < 5 {
self.count += 1;
Some(self.count)
} else {
None
}
}
}
#[test]
fn calling_next_directly() {
let mut counter = Counter::new();
assert_eq!(counter.next(), Some(1));
assert_eq!(counter.next(), Some(2));
assert_eq!(counter.next(), Some(3));
assert_eq!(counter.next(), Some(4));
assert_eq!(counter.next(), Some(5));
assert_eq!(counter.next(), None);
}
#[test]
fn using_other_iterator_trait_methods() {
let sum: u32 = Counter::new() // 1 2 3 4 5
.zip(Counter::new().skip(1)) // 2 3 4 5 None
.map(|(a, b)| a * b) // 2 6 12 20
.filter(|x| x % 3 == 0) // 6 12
.sum(); // 6 + 12 = 18
assert_eq!(18, sum);
}
使用迭代器和闭包简化search函数:
use minigrep::Config;
use std::env;
use std::process;
fn main() {
let config = Config::new(env::args()).unwrap_or_else(|err| {
eprintln!("Problem parsing arguments: {}", err);
process::exit(1);
});
if let Err(e) = minigrep::run(config) {
eprintln!("Application error: {}", e);
process::exit(1);
}
}
use std::env;
use std::error::Error;
use std::fs;
pub fn run(config: Config) -> Result<(), Box<dyn Error>> {
let contents = fs::read_to_string(config.filename)?;
let results = if config.case_sensitive {
search(&config.query, &contents)
} else {
search_case_insensitive(&config.query, &contents)
};
for line in results {
println!("line: {}", line);
}
// println!("With text:\n{}", contents);
// println!("query: {:?}", config.query);
Ok(())
}
pub struct Config {
pub query: String,
pub filename: String,
pub case_sensitive: bool,
}
impl Config {
pub fn new(mut args: std::env::Args) -> Result<Config, &'static str> {
if args.len() < 3 {
return Err("not enough arguments");
}
args.next();
let query = match args.next() {
Some(arg) => arg,
None => return Err("Didn't get a query string"),
};
let filename = match args.next() {
Some(arg) => arg,
None => return Err("Didn't get a file name"),
};
let case_sensitive = env::var("CASE_INSENSITIVE").is_err();
Ok(Config {
query,
filename,
case_sensitive,
})
}
}
pub fn search<'a>(query: &str, contents: &'a str) -> Vec<&'a str> {
// let mut results = Vec::new();
// for line in contents.lines() {
// if line.contains(query) {
// results.push(line);
// }
// }
// results
contents
.lines()
.filter(|line| line.contains(query))
.collect()
}
pub fn search_case_insensitive<'a>(query: &str, contents: &'a str) -> Vec<&'a str> {
// let mut results = Vec::new();
// let query = query.to_lowercase();
// for line in contents.lines() {
// if line.to_lowercase().contains(&query) {
// results.push(line);
// }
// }
// results
contents
.lines()
.filter(|line| line.to_lowercase().contains(&query.to_lowercase()))
.collect()
}
#[cfg(test)]
mod tests {
use super::*;
#[test]
// fn one_result() {
// let query = "duct";
// let contents = "\
// Rust:
// safe, fast, productive.
// Pick three.";
// assert_eq!(vec!["safe, fast, productive."], search(query, contents))
// }
fn case_sensitive() {
let query = "duct";
let contents = "\
Rust:
safe, fast, productive.
Pick three.
Duct tape.";
assert_eq!(vec!["safe, fast, productive."], search(query, contents))
}
#[test]
fn case_insensitive() {
let query = "rUsT";
let contents = "\
Rust:
safe, fase, productive.
Pick three.
Trust me.";
assert_eq!(
vec!["Rust:", "Trust me."],
search_case_insensitive(query, contents)
)
}
}
循环 vs 迭代器:迭代器因零开销抽象(Zero-Cost Abstraction)往往更快。
test bench_search_for ... bench: 19,620,300 ns/iter (+/- 915,700)
test bench_search_iter ... bench: 19,234,900 ns/iter (+/- 657,200)
let buffer: &mut [i32];
let coefficients: [i64; 12];
let qlp_shift: i16;
for i in 12..buffer.len() {
let prediction = coefficients.iter().zip(&buffer[i - 12..i]).map(|(&c, &s)| c * s as i64).sum::<i64>() >> qlp_shift;
let delta = buffer[i];
buffer[i] = prediction as i32 + delta;
}
闭包和迭代器是Rust中不可或缺的函数式编程工具,它们以高度的抽象和灵活性赋予开发者编写优雅、高效代码的能力。通过本文的学习,你不仅能理解闭包如何捕获环境、迭代器如何懒惰执行,还能在实际项目中运用这些特性优化性能。无论是简化逻辑还是提升运行效率,闭包与迭代器都能成为你的得力助手。希望你在Rust的编程旅途中,能灵活运用这些工具,写出更出色的代码!
如果觉得我的文章对您有用,请随意打赏。你的支持将鼓励我继续创作!