Project Vyx Zwo
Welcome to Vyx Zwo. Everything on the outside is the same as the first project
(Vyx Eins), but the code has been completely redone. All of the computing work
is done on my home computer, and the results are posted here. I think the other
main difference is that I will count accented characters as seperate characters
until I verify they are merely different forms. Keeps me from having to recount
a language if I found out
ç really isn't
c.
Links of interest:
According to these standards:
- 3,013 characters per page (220,000 characters take up 73 pages)
- 500 pages per inch
Vyx Zwo has read 23,023 pages of material (3.84 feet!).
Welcome to Project Vyx. I thought it would be interesting to see
how often each letter of English occurs in everyday use. I then thought it
would be interesting to see how often each letter of German is used, which soon
extended to other Germanic languages. Now I've decided to count every language
I can get my hands on.
To give you a rough estimate of how much processing is going on behind the scenes,
image this: 220,000 characters take up 73 pages in Word, Which is 3,013 characters
a page. This translates to 5,000,000 characters taking up 1,690 pages. Your average
package of paper has 500 pages, so you'd need a stack of about 4 inches of paper
to print out everything this script reads. Per language.
The results posted here probably aren't "fair". That is, the text I count
is probably biased in some way. For example, a lot of my text comes from versions of
the Bible translated into various languages - that means the letters J, E, S, and U
may be inaccurately represented.
A Note on Help
Having people who know a language is really helpful when I'm trying to count it. Take
Arabic, for example - I know nothing about the Arabic script and need to find somebody
who does before I can even decide if it's countable by Vyx. For this reason I must
stick with languages that use a Latin-based alphabet for the time being.
The Data
These are all the languages I've counted in the order I counted them:
- English
- German
- Swedish
- Norwegian
- French
- Italian
- Hungarian
- Spanish
- Portuguese
- Danish *
- Turkish *
- Platt (Niederdeutsch) *
- Cebuano *
*: these languages aren't finished for one reason or another. I count a language finished once I have sent
5 million characters through the program.
Project Vyx results by Randi Hagen 2004-05 @ http://www.kriona.net/dml/vyx/