Character counts in tei_all.rng¹, using the following parameters:
Click on a column header to sort by that column.
count | codepoint | character | character name |
---|---|---|---|
280087 | U+0020 | SPACE | |
37145 | U+0065 | e | LATIN SMALL LETTER E |
35923 | U+0074 | t | LATIN SMALL LETTER T |
25290 | U+0061 | a | LATIN SMALL LETTER A |
23178 | U+000A | LINE FEED (LF) | |
21630 | U+0069 | i | LATIN SMALL LETTER I |
19580 | U+006E | n | LATIN SMALL LETTER N |
18658 | U+006F | o | LATIN SMALL LETTER O |
17844 | U+0072 | r | LATIN SMALL LETTER R |
16199 | U+0073 | s | LATIN SMALL LETTER S |
13067 | U+006C | l | LATIN SMALL LETTER L |
11290 | U+0063 | c | LATIN SMALL LETTER C |
10798 | U+002E | . | FULL STOP or PERIOD |
9865 | U+0064 | d | LATIN SMALL LETTER D |
8056 | U+0075 | u | LATIN SMALL LETTER U |
8027 | U+0070 | p | LATIN SMALL LETTER P |
7989 | U+006D | m | LATIN SMALL LETTER M |
6992 | U+0068 | h | LATIN SMALL LETTER H |
6694 | U+0062 | b | LATIN SMALL LETTER B |
5324 | U+0067 | g | LATIN SMALL LETTER G |
4611 | U+0066 | f | LATIN SMALL LETTER F |
2910 | U+0079 | y | LATIN SMALL LETTER Y |
1821 | U+006B | k | LATIN SMALL LETTER K |
1671 | U+0076 | v | LATIN SMALL LETTER V |
1625 | U+002D | - | HYPHEN-MINUS |
1568 | U+0077 | w | LATIN SMALL LETTER W |
1358 | U+002C | , | COMMA |
1341 | U+005D | ] | RIGHT SQUARE BRACKET or CLOSING SQUARE BRACKET |
1334 | U+0028 | ( | LEFT PARENTHESIS or OPENING PARENTHESIS |
1334 | U+0029 | ) | RIGHT PARENTHESIS or CLOSING PARENTHESIS |
1205 | U+0078 | x | LATIN SMALL LETTER X |
1102 | U+0053 | S | LATIN CAPITAL LETTER S |
1100 | U+0031 | 1 | DIGIT ONE |
1052 | U+004C | L | LATIN CAPITAL LETTER L |
1015 | U+005B | [ | LEFT SQUARE BRACKET or OPENING SQUARE BRACKET |
770 | U+0032 | 2 | DIGIT TWO |
702 | U+005F | _ | LOW LINE or SPACING UNDERSCORE |
695 | U+0044 | D | LATIN CAPITAL LETTER D |
676 | U+0052 | R | LATIN CAPITAL LETTER R |
667 | U+003A | : | COLON |
662 | U+0033 | 3 | DIGIT THREE |
659 | U+005C | \ | REVERSE SOLIDUS or BACKSLASH |
652 | U+0071 | q | LATIN SMALL LETTER Q |
630 | U+0049 | I | LATIN CAPITAL LETTER I |
609 | U+0043 | C | LATIN CAPITAL LETTER C |
576 | U+0054 | T | LATIN CAPITAL LETTER T |
563 | U+0050 | P | LATIN CAPITAL LETTER P |
462 | U+002B | + | PLUS SIGN |
444 | U+0040 | @ | COMMERCIAL AT |
389 | U+004E | N | LATIN CAPITAL LETTER N |
353 | U+007B | { | LEFT CURLY BRACKET or OPENING CURLY BRACKET |
353 | U+007D | } | RIGHT CURLY BRACKET or CLOSING CURLY BRACKET |
348 | U+0045 | E | LATIN CAPITAL LETTER E |
341 | U+0034 | 4 | DIGIT FOUR |
318 | U+0030 | 0 | DIGIT ZERO |
315 | U+004F | O | LATIN CAPITAL LETTER O |
313 | U+003B | ; | SEMICOLON |
301 | U+007A | z | LATIN SMALL LETTER Z |
293 | U+002F | / | SOLIDUS or SLASH |
286 | U+0055 | U | LATIN CAPITAL LETTER U |
267 | U+006A | j | LATIN SMALL LETTER J |
259 | U+0047 | G | LATIN CAPITAL LETTER G |
250 | U+004D | M | LATIN CAPITAL LETTER M |
207 | U+0027 | ' | APOSTROPHE or APOSTROPHE-QUOTE |
200 | U+0036 | 6 | DIGIT SIX |
195 | U+0041 | A | LATIN CAPITAL LETTER A |
191 | U+0046 | F | LATIN CAPITAL LETTER F |
190 | U+0035 | 5 | DIGIT FIVE |
184 | U+005A | Z | LATIN CAPITAL LETTER Z |
177 | U+0037 | 7 | DIGIT SEVEN |
175 | U+0042 | B | LATIN CAPITAL LETTER B |
175 | U+005E | ^ | CIRCUMFLEX ACCENT or SPACING CIRCUMFLEX |
140 | U+0039 | 9 | DIGIT NINE |
129 | U+003E | > | GREATER-THAN SIGN |
125 | U+003C | < | LESS-THAN SIGN |
123 | U+002A | * | ASTERISK |
112 | U+0057 | W | LATIN CAPITAL LETTER W |
106 | U+0038 | 8 | DIGIT EIGHT |
90 | U+007C | | | VERTICAL LINE or VERTICAL BAR |
86 | U+0056 | V | LATIN CAPITAL LETTER V |
84 | U+0058 | X | LATIN CAPITAL LETTER X |
67 | U+0048 | H | LATIN CAPITAL LETTER H |
66 | U+003F | ? | QUESTION MARK |
62 | U+004B | K | LATIN CAPITAL LETTER K |
54 | U+0022 | " | QUOTATION MARK |
41 | U+003D | = | EQUALS SIGN |
40 | U+0059 | Y | LATIN CAPITAL LETTER Y |
31 | U+0024 | $ | DOLLAR SIGN |
31 | U+004A | J | LATIN CAPITAL LETTER J |
18 | U+0051 | Q | LATIN CAPITAL LETTER Q |
12 | U+0060 | ` | GRAVE ACCENT or SPACING GRAVE |
4 | U+0025 | % | PERCENT SIGN |
4 | U+202F | NARROW NO-BREAK SPACE | |
4 | U+2070 | ⁰ | SUPERSCRIPT ZERO or SUPERSCRIPT DIGIT ZERO |
4 | U+00B3 | ³ | SUPERSCRIPT THREE or SUPERSCRIPT DIGIT THREE |
4 | U+2014 | — | EM DASH |
3 | U+0023 | # | NUMBER SIGN |
3 | U+00B9 | ¹ | SUPERSCRIPT ONE or SUPERSCRIPT DIGIT ONE |
2 | U+00C5 | Å | LATIN CAPITAL LETTER A WITH RING ABOVE or LATIN CAPITAL LETTER A RING |
2 | U+00E5 | å | LATIN SMALL LETTER A WITH RING ABOVE or LATIN SMALL LETTER A RING |
2 | U+00E9 | é | LATIN SMALL LETTER E WITH ACUTE or LATIN SMALL LETTER E ACUTE |
2 | U+03A9 | Ω | GREEK CAPITAL LETTER OMEGA |
2 | U+00B2 | ² | SUPERSCRIPT TWO or SUPERSCRIPT DIGIT TWO |
2 | U+00F6 | ö | LATIN SMALL LETTER O WITH DIAERESIS or LATIN SMALL LETTER O DIAERESIS |
1 | U+00A0 | NO-BREAK SPACE or NON-BREAKING SPACE | |
1 | U+00E6 | æ | LATIN SMALL LETTER AE or LATIN SMALL LETTER A E |
1 | U+2076 | ⁶ | SUPERSCRIPT SIX or SUPERSCRIPT DIGIT SIX |
1 | U+207B | ⁻ | SUPERSCRIPT MINUS or SUPERSCRIPT HYPHEN-MINUS |
Total characters counted: 624,989.
This table generated 2024-08-16T13:25:48.761708292-04:00.
¹ file:/home/syd/Downloads/tei_all.rng