From 386c0bcda597914671dfeb4e1391f41e71465e91 Mon Sep 17 00:00:00 2001 From: Boris Kolpackov Date: Mon, 9 Mar 2020 12:02:24 +0200 Subject: Document UTF-8 encoding for buildfiles and testscripts --- doc/testscript.cli | 4 ++++ 1 file changed, 4 insertions(+) (limited to 'doc/testscript.cli') diff --git a/doc/testscript.cli b/doc/testscript.cli index 5d50472..d39893a 100644 --- a/doc/testscript.cli +++ b/doc/testscript.cli @@ -1082,6 +1082,10 @@ failures (due to unexpected output) to go undetected. \h1#lexical|Lexical Structure| +At the lexical level, testscripts are UTF-8 encoded text restricted to the +Unicode graphic characters, tabs (\c{\\t}), carriage returns (\c{\\r}), and +line feeds (\c{\\n}). + Testscript is a line-oriented language with a context-dependent lexical structure. It \"borrows\" several building blocks (variable expansion, function calls, and evaluation contexts; collectively called \i{expansions} -- cgit v1.1