This is one aspect which I think has blown up the complexity of many tr-implementations around today. Instead of complicating the set-theory-based parser itself (he should still be relying on one rune per char, not multirunes), I added a preprocessor, which basically scans the code for upcoming '\'s, reads what he finds, substitutes the real character onto '\'s index and shifts the entire following array so there are no "holes". What is left to reflect on is what to do with octal sequences. I have a local implementation here, which works fine, but imho, given tr is already so focused on UTF-8, we might as well ignore POSIX at this point and rather implement the unicode UTF-8 code points, which are way more contemporary and future-proof. Reading in \uC3A4 as a an array of 0xC3 and 0xA4 is not the issue, but I'm still struggling to find a way to turn it into a well-formed byte sequence. Hit me with a mail if you have a simple solution for that.
sbase - suckless unix tools =========================== sbase is a collection of unix tools that are inherently portable across UNIX and UNIX-like systems. The following tools are implemented: UTILITY POSIX 2008 COMPLIANT MISSING OPTIONS ------- -------------------- --------------- basename cal cat yes none chgrp chmod chown chroot cksum cmp cols comm cp cron cut date dirname du echo env expand expr false fold grep head hostname kill yes none link ln logger yes logname ls md5sum mkdir mkfifo mktemp mv nice nl nohup paste printenv printf pwd readlink renice rm rmdir sleep setsid sort split sponge strings sync tail tar tee test touch tr true tty uudecode uuencode uname unexpand uniq unlink seq sha1sum sha256sum sha512sum wc xargs yes The complement of sbase is ubase[1] which is Linux-specific and provides all the non-portable tools. Together they are intended to form a base system similar to busybox but much smaller and suckless. Building -------- You need GNU make to build sbase on OpenBSD. To build sbase, simply type make. You may have to fiddle with config.mk depending on your system. You can also build sbase-box, which generates a single binary containing all the required tools. You can then symlink the individual tools to sbase-box. Ideally you will want to statically link sbase. If you are on Linux we recommend using musl-libc[2]. Portability ----------- sbase has been compiled on a variety of different operating systems, including Linux, *BSD, OSX, Haiku, Solaris, SCO OpenServer and others. Various combinations of operating systems and architectures have also been built. You can build sbase with gcc, clang, tcc, nwcc and pcc. [1] http://git.suckless.org/ubase/ [2] http://www.musl-libc.org/
Description
Languages
C
79.5%
Roff
16%
Shell
1.9%
Awk
1.3%
Makefile
1.3%