Struct bumpalo::collections::string::String

source ·

pub struct String<'bump> { /* private fields */ }

Expand description

A UTF-8 encoded, growable string.

The String type is the most common string type that has ownership over the contents of the string. It has a close relationship with its borrowed counterpart, the primitive str.

Examples

You can create a String from a literal string with String::from_str_in:

use bumpalo::{Bump, collections::String};

let b = Bump::new();

let hello = String::from_str_in("Hello, world!", &b);

You can append a char to a String with the push method, and append a &str with the push_str method:

use bumpalo::{Bump, collections::String};

let b = Bump::new();

let mut hello = String::from_str_in("Hello, ", &b);

hello.push('w');
hello.push_str("orld!");

If you have a vector of UTF-8 bytes, you can create a String from it with the from_utf8 method:

use bumpalo::{Bump, collections::String};

let b = Bump::new();

// some bytes, in a vector
let sparkle_heart = bumpalo::vec![in &b; 240, 159, 146, 150];

// We know these bytes are valid, so we'll use `unwrap()`.
let sparkle_heart = String::from_utf8(sparkle_heart).unwrap();

assert_eq!("💖", sparkle_heart);

Deref

Strings implement Deref<Target = str>, and so inherit all of str’s methods. In addition, this means that you can pass a String to a function which takes a &str by using an ampersand (&):

use bumpalo::{Bump, collections::String};

let b = Bump::new();

fn takes_str(s: &str) { }

let s = String::from_str_in("Hello", &b);

takes_str(&s);

This will create a &str from the String and pass it in. This conversion is very inexpensive, and so generally, functions will accept &strs as arguments unless they need a String for some specific reason.

In certain cases Rust doesn’t have enough information to make this conversion, known as Deref coercion. In the following example a string slice &'a str implements the trait TraitExample, and the function example_func takes anything that implements the trait. In this case Rust would need to make two implicit conversions, which Rust doesn’t have the means to do. For that reason, the following example will not compile.

use bumpalo::{Bump, collections::String};

trait TraitExample {}

impl<'a> TraitExample for &'a str {}

fn example_func<A: TraitExample>(example_arg: A) {}

let b = Bump::new();
let example_string = String::from_str_in("example_string", &b);
example_func(&example_string);

There are two options that would work instead. The first would be to change the line example_func(&example_string); to example_func(example_string.as_str());, using the method as_str() to explicitly extract the string slice containing the string. The second way changes example_func(&example_string); to example_func(&*example_string);. In this case we are dereferencing a String to a str, then referencing the str back to &str. The second way is more idiomatic, however both work to do the conversion explicitly rather than relying on the implicit conversion.

Representation

A String is made up of three components: a pointer to some bytes, a length, and a capacity. The pointer points to an internal buffer String uses to store its data. The length is the number of bytes currently stored in the buffer, and the capacity is the size of the buffer in bytes. As such, the length will always be less than or equal to the capacity.

This buffer is always stored on the heap.

You can look at these with the as_ptr, len, and capacity methods:

use bumpalo::{Bump, collections::String};
use std::mem;

let b = Bump::new();

let mut story = String::from_str_in("Once upon a time...", &b);

let ptr = story.as_mut_ptr();
let len = story.len();
let capacity = story.capacity();

// story has nineteen bytes
assert_eq!(19, len);

// Now that we have our parts, we throw the story away.
mem::forget(story);

// We can re-build a String out of ptr, len, and capacity. This is all
// unsafe because we are responsible for making sure the components are
// valid:
let s = unsafe { String::from_raw_parts_in(ptr, len, capacity, &b) } ;

assert_eq!(String::from_str_in("Once upon a time...", &b), s);

If a String has enough capacity, adding elements to it will not re-allocate. For example, consider this program:

use bumpalo::{Bump, collections::String};

let b = Bump::new();

let mut s = String::new_in(&b);

println!("{}", s.capacity());

for _ in 0..5 {
    s.push_str("hello");
    println!("{}", s.capacity());
}

This will output the following:

At first, we have no memory allocated at all, but as we append to the string, it increases its capacity appropriately. If we instead use the with_capacity_in method to allocate the correct capacity initially:

use bumpalo::{Bump, collections::String};

let b = Bump::new();

let mut s = String::with_capacity_in(25, &b);

println!("{}", s.capacity());

for _ in 0..5 {
    s.push_str("hello");
    println!("{}", s.capacity());
}

We end up with a different output:

Here, there’s no need to allocate more memory inside the loop.

Struct bumpalo::collections::string::String

Implementations§

impl<'bump> String<'bump>

pub fn new_in(bump: &'bump Bump) -> String<'bump>

pub fn with_capacity_in(capacity: usize, bump: &'bump Bump) -> String<'bump>

pub fn from_utf8( vec: Vec<'bump, u8> ) -> Result<String<'bump>, FromUtf8Error<'bump>>

pub fn from_utf8_lossy_in(v: &[u8], bump: &'bump Bump) -> String<'bump>

pub fn from_utf16_in( v: &[u16], bump: &'bump Bump ) -> Result<String<'bump>, FromUtf16Error>

pub fn from_str_in(s: &str, bump: &'bump Bump) -> String<'bump>

pub fn from_iter_in<I: IntoIterator<Item = char>>( iter: I, bump: &'bump Bump ) -> String<'bump>

pub unsafe fn from_raw_parts_in( buf: *mut u8, length: usize, capacity: usize, bump: &'bump Bump ) -> String<'bump>

pub unsafe fn from_utf8_unchecked(bytes: Vec<'bump, u8>) -> String<'bump>

pub fn bump(&self) -> &'bump Bump

pub fn into_bytes(self) -> Vec<'bump, u8>

pub fn into_bump_str(self) -> &'bump str

pub fn as_str(&self) -> &str

pub fn as_mut_str(&mut self) -> &mut str

pub fn push_str(&mut self, string: &str)

pub fn capacity(&self) -> usize

pub fn reserve(&mut self, additional: usize)

pub fn reserve_exact(&mut self, additional: usize)

pub fn shrink_to_fit(&mut self)

pub fn push(&mut self, ch: char)

pub fn as_bytes(&self) -> &[u8]

pub fn truncate(&mut self, new_len: usize)

pub fn pop(&mut self) -> Option<char>

pub fn remove(&mut self, idx: usize) -> char

pub fn retain<F>(&mut self, f: F)where F: FnMut(char) -> bool,

pub fn insert(&mut self, idx: usize, ch: char)

pub fn insert_str(&mut self, idx: usize, string: &str)

pub unsafe fn as_mut_vec(&mut self) -> &mut Vec<'bump, u8>

pub fn len(&self) -> usize

pub fn is_empty(&self) -> bool

pub fn split_off(&mut self, at: usize) -> String<'bump>

pub fn clear(&mut self)

pub fn drain<'a, R>(&'a mut self, range: R) -> Drain<'a, 'bump> ⓘwhere R: RangeBounds<usize>,

pub fn replace_range<R>(&mut self, range: R, replace_with: &str)where R: RangeBounds<usize>,

Methods from Deref<Target = str>§

pub fn len(&self) -> usize

pub fn is_empty(&self) -> bool

pub fn is_char_boundary(&self, index: usize) -> bool

pub fn floor_char_boundary(&self, index: usize) -> usize

pub fn ceil_char_boundary(&self, index: usize) -> usize

pub fn as_bytes(&self) -> &[u8]

pub unsafe fn as_bytes_mut(&mut self) -> &mut [u8]

pub fn as_ptr(&self) -> *const u8

pub fn as_mut_ptr(&mut self) -> *mut u8

pub fn get<I>(&self, i: I) -> Option<&<I as SliceIndex<str>>::Output>where I: SliceIndex<str>,

pub fn get_mut<I>( &mut self, i: I ) -> Option<&mut <I as SliceIndex<str>>::Output>where I: SliceIndex<str>,

pub unsafe fn get_unchecked<I>(&self, i: I) -> &<I as SliceIndex<str>>::Outputwhere I: SliceIndex<str>,

pub unsafe fn get_unchecked_mut<I>( &mut self, i: I ) -> &mut <I as SliceIndex<str>>::Outputwhere I: SliceIndex<str>,

pub unsafe fn slice_unchecked(&self, begin: usize, end: usize) -> &str

pub unsafe fn slice_mut_unchecked( &mut self, begin: usize, end: usize ) -> &mut str

pub fn split_at(&self, mid: usize) -> (&str, &str)

pub fn split_at_mut(&mut self, mid: usize) -> (&mut str, &mut str)

pub fn chars(&self) -> Chars<'_>

pub fn char_indices(&self) -> CharIndices<'_>

pub fn bytes(&self) -> Bytes<'_>

pub fn split_whitespace(&self) -> SplitWhitespace<'_>

pub fn split_ascii_whitespace(&self) -> SplitAsciiWhitespace<'_>

pub fn lines(&self) -> Lines<'_>

pub fn lines_any(&self) -> LinesAny<'_>

pub fn encode_utf16(&self) -> EncodeUtf16<'_>