tar-no-std/README.md

60 lines
2.6 KiB
Markdown
Raw Normal View History

2021-10-04 17:36:08 +08:00
# `tar-no-std` - Parse Tar Archives (Tarballs)
2021-10-04 17:00:03 +08:00
_Due to historical reasons, there are several formats of tar archives. All of them are based on the same principles,
but have some subtle differences that often make them incompatible with each other._ [[0]]
Library to read Tar archives (by GNU Tar) in `no_std` contexts with zero allocations. If you have a standard
environment and need full feature support, I recommend the use of <https://crates.io/crates/tar> instead.
2022-01-03 17:43:51 +08:00
## Limitations
2021-10-04 17:00:03 +08:00
The crate is simple and only supports reading of "basic" archives, therefore no extensions, such
as *GNU Longname*. The maximum supported file name length is 100 characters including the NULL-byte.
The maximum supported file size is 8GiB. Also, directories are not supported yet but only flat
collections of files.
2022-01-03 17:43:51 +08:00
## Use Case
2021-10-04 17:00:03 +08:00
This library is useful, if you write a kernel or a similar low-level application, which needs
"a bunch of files" from an archive ("init ramdisk"). The Tar file could for example come
as a Multiboot2 boot module provided by the bootloader.
This crate focuses on extracting files from uncompressed Tar archives created with default options by **GNU Tar**.
GNU Extensions such as sparse files, incremental archives, and long filename extension are not supported yet.
[This link](https://www.gnu.org/software/tar/manual/html_section/Formats.html) gives a good overview over possible
archive formats and their limitations.
## Example (without `alloc`-feature)
2021-10-04 17:31:11 +08:00
```rust
use tar_no_std::TarArchiveRef;
2021-10-04 17:31:11 +08:00
fn main() {
// log: not mandatory
std::env::set_var("RUST_LOG", "trace");
env_logger::init();
2021-10-04 18:25:53 +08:00
// also works in no_std environment (except the println!, of course)
2021-10-04 17:31:11 +08:00
let archive = include_bytes!("../tests/gnu_tar_default.tar");
let archive = TarArchiveRef::new(archive);
2021-10-04 18:25:53 +08:00
// Vec needs an allocator of course, but the library itself doesn't need one
2021-10-04 17:31:11 +08:00
let entries = archive.entries().collect::<Vec<_>>();
println!("{:#?}", entries);
2021-10-04 18:25:53 +08:00
println!("content of last file:");
println!("{:#?}", entries[2].data_as_str().expect("Invalid UTF-8") );
2021-10-04 17:31:11 +08:00
}
```
2021-10-04 17:00:03 +08:00
## Alloc Feature
2021-10-11 21:36:48 +08:00
This crate allows the usage of the additional Cargo build time feature `alloc`. When this is used,
the crate also provides the type `TarArchive`, which owns the data on the heap.
## Compression (`tar.gz`)
2021-10-04 19:46:03 +08:00
If your tar file is compressed, e.g. by `.tar.gz`/`gzip`, you need to uncompress the bytes first
2022-01-03 17:43:51 +08:00
(e.g. by a *gzip* library). Afterwards, this crate can read the Tar archive format from the uncompressed
bytes.
2021-10-04 17:00:03 +08:00
## MSRV
2022-05-03 03:59:51 +08:00
The MSRV is 1.52.1 stable.
2021-10-04 17:00:03 +08:00
[0]: https://www.gnu.org/software/tar/manual/html_section/Formats.html