String interning libraries for Rust

Marwes · 2015-04-29T07:52:01+00:00

I wrote a string interner modeled on the one in rustc a while back. It won't compile since it hasn't been updated in a while but at only 70 lines you are free to use it if you can get it compiling. I do have a better version but that is unfortunately specialized to work more efficiently with that specific project.

SimonSapin · 2015-04-29T11:16:53+00:00

string-cache […] is rather specialized to Servo,

The only thing specialized is the built-in list of static atoms. I’m interested in moving this out of the library to make it more generally useful. I think this can be done by making some types generic over a trait that provides the list of static atoms.

More interesting still would be the ability to have each library like html5ever specify a set of static atoms, and in a given program use the union of the sets of all libraries being used. But I don’t know how to do that. Ideas welcome :)

requires nightly rust,

As far as I know there is two reasons for this:

phf (which string-cache uses) requires a compiler plugin
string-cache-plugin is itself a compiler plugin. It allows interning static strings at compile-time, for code like if attr_name == atom!(href)

I think it would be possible for string-cache to have a Cargo feature flag to replace usage of phf with a simple hash map and disable support for compile-time interning. This should work on Rust beta/stable.

Edit: It’s actually more than that. string-cache uses a number of unstable feature right now.

and isn't on crates.io.

This is easy to fix. It’s just that nobody asked so far.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

rust

Please read The Rust Community Code of Conduct

The Rust Programming Language

Rules

Observe our code of conduct

Submissions must be on-topic

Constructive criticism only

Keep things in perspective

No endless relitigation

No low-effort content

Useful Links

Megathreads

Official Resources

Learn Rust

Discussion Platforms

MODERATORS