Character counts in tei_all.rng

Character counts in tei_all.rng¹, using the following parameters:

attrs
fold
skip
whitespace

Click on a column header to sort by that column.

count codepoint character character name
280087 U+0020 SPACE
37145 U+0065 e LATIN SMALL LETTER E
35923 U+0074 t LATIN SMALL LETTER T
25290 U+0061 a LATIN SMALL LETTER A
23178 U+000A LINE FEED (LF)
21630 U+0069 i LATIN SMALL LETTER I
19580 U+006E n LATIN SMALL LETTER N
18658 U+006F o LATIN SMALL LETTER O
17844 U+0072 r LATIN SMALL LETTER R
16199 U+0073 s LATIN SMALL LETTER S
13067 U+006C l LATIN SMALL LETTER L
11290 U+0063 c LATIN SMALL LETTER C
10798 U+002E . FULL STOP or PERIOD
9865 U+0064 d LATIN SMALL LETTER D
8056 U+0075 u LATIN SMALL LETTER U
8027 U+0070 p LATIN SMALL LETTER P
7989 U+006D m LATIN SMALL LETTER M
6992 U+0068 h LATIN SMALL LETTER H
6694 U+0062 b LATIN SMALL LETTER B
5324 U+0067 g LATIN SMALL LETTER G
4611 U+0066 f LATIN SMALL LETTER F
2910 U+0079 y LATIN SMALL LETTER Y
1821 U+006B k LATIN SMALL LETTER K
1671 U+0076 v LATIN SMALL LETTER V
1625 U+002D - HYPHEN-MINUS
1568 U+0077 w LATIN SMALL LETTER W
1358 U+002C , COMMA
1341 U+005D ] RIGHT SQUARE BRACKET or CLOSING SQUARE BRACKET
1334 U+0028 ( LEFT PARENTHESIS or OPENING PARENTHESIS
1334 U+0029 ) RIGHT PARENTHESIS or CLOSING PARENTHESIS
1205 U+0078 x LATIN SMALL LETTER X
1102 U+0053 S LATIN CAPITAL LETTER S
1100 U+0031 1 DIGIT ONE
1052 U+004C L LATIN CAPITAL LETTER L
1015 U+005B [ LEFT SQUARE BRACKET or OPENING SQUARE BRACKET
770 U+0032 2 DIGIT TWO
702 U+005F _ LOW LINE or SPACING UNDERSCORE
695 U+0044 D LATIN CAPITAL LETTER D
676 U+0052 R LATIN CAPITAL LETTER R
667 U+003A : COLON
662 U+0033 3 DIGIT THREE
659 U+005C \ REVERSE SOLIDUS or BACKSLASH
652 U+0071 q LATIN SMALL LETTER Q
630 U+0049 I LATIN CAPITAL LETTER I
609 U+0043 C LATIN CAPITAL LETTER C
576 U+0054 T LATIN CAPITAL LETTER T
563 U+0050 P LATIN CAPITAL LETTER P
462 U+002B + PLUS SIGN
444 U+0040 @ COMMERCIAL AT
389 U+004E N LATIN CAPITAL LETTER N
353 U+007B { LEFT CURLY BRACKET or OPENING CURLY BRACKET
353 U+007D } RIGHT CURLY BRACKET or CLOSING CURLY BRACKET
348 U+0045 E LATIN CAPITAL LETTER E
341 U+0034 4 DIGIT FOUR
318 U+0030 0 DIGIT ZERO
315 U+004F O LATIN CAPITAL LETTER O
313 U+003B ; SEMICOLON
301 U+007A z LATIN SMALL LETTER Z
293 U+002F / SOLIDUS or SLASH
286 U+0055 U LATIN CAPITAL LETTER U
267 U+006A j LATIN SMALL LETTER J
259 U+0047 G LATIN CAPITAL LETTER G
250 U+004D M LATIN CAPITAL LETTER M
207 U+0027 ' APOSTROPHE or APOSTROPHE-QUOTE
200 U+0036 6 DIGIT SIX
195 U+0041 A LATIN CAPITAL LETTER A
191 U+0046 F LATIN CAPITAL LETTER F
190 U+0035 5 DIGIT FIVE
184 U+005A Z LATIN CAPITAL LETTER Z
177 U+0037 7 DIGIT SEVEN
175 U+0042 B LATIN CAPITAL LETTER B
175 U+005E ^ CIRCUMFLEX ACCENT or SPACING CIRCUMFLEX
140 U+0039 9 DIGIT NINE
129 U+003E > GREATER-THAN SIGN
125 U+003C < LESS-THAN SIGN
123 U+002A * ASTERISK
112 U+0057 W LATIN CAPITAL LETTER W
106 U+0038 8 DIGIT EIGHT
90 U+007C | VERTICAL LINE or VERTICAL BAR
86 U+0056 V LATIN CAPITAL LETTER V
84 U+0058 X LATIN CAPITAL LETTER X
67 U+0048 H LATIN CAPITAL LETTER H
66 U+003F ? QUESTION MARK
62 U+004B K LATIN CAPITAL LETTER K
54 U+0022 " QUOTATION MARK
41 U+003D = EQUALS SIGN
40 U+0059 Y LATIN CAPITAL LETTER Y
31 U+0024 $ DOLLAR SIGN
31 U+004A J LATIN CAPITAL LETTER J
18 U+0051 Q LATIN CAPITAL LETTER Q
12 U+0060 ` GRAVE ACCENT or SPACING GRAVE
4 U+0025 % PERCENT SIGN
4 U+202F NARROW NO-BREAK SPACE
4 U+2070 SUPERSCRIPT ZERO or SUPERSCRIPT DIGIT ZERO
4 U+00B3 ³ SUPERSCRIPT THREE or SUPERSCRIPT DIGIT THREE
4 U+2014 EM DASH
3 U+0023 # NUMBER SIGN
3 U+00B9 ¹ SUPERSCRIPT ONE or SUPERSCRIPT DIGIT ONE
2 U+00C5 Å LATIN CAPITAL LETTER A WITH RING ABOVE or LATIN CAPITAL LETTER A RING
2 U+00E5 å LATIN SMALL LETTER A WITH RING ABOVE or LATIN SMALL LETTER A RING
2 U+00E9 é LATIN SMALL LETTER E WITH ACUTE or LATIN SMALL LETTER E ACUTE
2 U+03A9 Ω GREEK CAPITAL LETTER OMEGA
2 U+00B2 ² SUPERSCRIPT TWO or SUPERSCRIPT DIGIT TWO
2 U+00F6 ö LATIN SMALL LETTER O WITH DIAERESIS or LATIN SMALL LETTER O DIAERESIS
1 U+00A0   NO-BREAK SPACE or NON-BREAKING SPACE
1 U+00E6 æ LATIN SMALL LETTER AE or LATIN SMALL LETTER A E
1 U+2076 SUPERSCRIPT SIX or SUPERSCRIPT DIGIT SIX
1 U+207B SUPERSCRIPT MINUS or SUPERSCRIPT HYPHEN-MINUS

Total characters counted: 624,989.

This table generated 2024-08-16T13:25:48.761708292-04:00.


¹ file:/home/syd/Downloads/tei_all.rng