PDA

Pogčedajte punu verziju : Domain Name - Statistike


conica
13. 06. 2006., 12:13
Kako su mi web statistike inace oblast vredna interesovanja, resila da pokrenem topic gde bih iznela neke statisticke podatke koje ce mozda neko naci korisnim. Ovo su neke cinjenice i statistike vezane za nazive domena:


broj karaktera - .COM domeni sada moraju da imaju minimum 3 karaktera. Od 17.576 mogucih kombinacija sa tri karaktera, svi su vec zauzeti.
Upotrebom samo 5 slova u nazivu domena, dobija se 11.881.376 mogucnosti, od kojih je nesto vise od 11.000.000 slobodno.
Stanje:
2 slova - 100% zauzeto
3 slova - 100% zauzeto
4 slova - 79% zauzeto
5 slova - 9% zauzeto
duzina naziva domena - kako je vec spomenuto, 100% domena sa dva i tri slova su vec zauzeti. Najveci je procenat duzine domena od karaktera (oko 4.100.000), domena duzine 31 karakter ima nesto manje od 100.000, dok ima oko 600 domena maksimalne duzine - 63 karaktera.
Rast broja domena od 2 do 11 karaktera (gde je maksimum) je 2x veci nego pad od 11 do 31 broja karaktera.
Ima oko 25.000 domena duzine 36 karaktera, oko 20.000 domena duzine 37 karaktera, oko 10.000 duzine 40 karaktera i oko 2500 duzine 49 karaktera.
licna imena - ukoliko ste zainteresovani za muska imena, razocarace vas podatak da je 100% domena muskih imena (izlistanih sa US Census Bureau) zauzeto, Sto se zenskih imena tice, slobodna su 2 (verovatno BILA slobodna:)). 100% najpoznatijih prezimena je takodje zauzeto (od top 10.000)
i love... - 70% domena koji kao prefix imaju "ILOVE" i nastavljaju se zenskim imenom je slobodno, dok je samo 54% "ILOVE" pa musko ime slobodno (rekla bih da zene mnogo vise iskazuju svoja osecanja na IT nacin :)).
Oko 270.000 domena sadrzi rec "SEX", dok svega oko 145.000 sadrzi rec "LOVE".
pocetna slova i brojevi - Slovo kojim pocinje najveci broj domena je "S" (oko 4.600.000), zatim "C" (oko 3.900.000) pa "A" (oko 3.400.000). ajmanji je broj domena koji pocinje slovom "Q" (oko 250.000).
Sto se tice cifara, najcesca je cifra "1" (oko 270.000) gde slede "2" (oko 155.000) i "4" (oko 120.000). Najmanji je broj domena koji zapocinju cifrom "6" i "0" (oko 45.000)
sufixi - najveci broj domena kao sufiks ima rec "GOOGLE" (oko 8000), zatim "YAHOO" (oko 7100), "MICROSOFT" (oko 3100) i "SLASHDOT" (oko 100) - primeri: google-america, microsoft-ebooks, slashdotslash itd.
slogovi - sto se tice pocetnih slogova, ubedljivo vodi slog "THE" sa oko 800.000 domena, dok je odmah iza njega slog "MAR" sa tek oko 250.000.


toliko za sada
cheers
Cony

bluesman
13. 06. 2006., 12:39
Tnx, very nice, koji je izvor ovih informacija?

noviKorisnik
13. 06. 2006., 12:51
Sviđa mi se ... valja razmisliti o izboru imena ... pre neki dan sam doživeo da se jedan član ovog foruma hvata za glavu jer mu ime domena počinje na D (... FF izbacuje sugestije za autocomplete sortirano po frekvenciji korišćenja :-D

conica
13. 06. 2006., 12:52
Izvestaj je napisao Dennis Forbes

Uvodni tekst:

I recently had a need for a mid-sized amount of real-world data, which I required for testing purposes on low-end hardware (testing and demonstrating some of the new functionality of SQL Server 2005). I wanted something that wasn't confidential, which excluded the easy choice of using business data, and I refrain from using artificial data. Around the same time I happened across the requisition process for the .COM/.NET and .EDU TLD zones, so I made a request for access.

Soon enough I had the 3.5GB of .COM domain names, along with 650MB of .NET, loaded into the database (although for all results in this entry I only included the .COM TLD, for the data as of 2pm on March 28th, 2006. I'll analyze the other ones at a future date). It was a great foundation for a lot of tests and demonstrations, and served my original goal admirably. I didn't stop there, however; Curiousity led me to do some basic analysis to see what sorts of domain names are registered, and how saturated the registry really is.

Note that these are the Verisign distributed zone files, and do not include entries that have no nameservers configured, or which are in a hold state. While those comprise a very small minority of domain names, it does skew the results a bit. To improve accuracy when the sample set is small, for some of the tests I have validated the positives using the WHOIS infrastructure (for instance the domain file had several two letter sequences as being "available", and a dozen three letter sequences. All of them were the result of a hold state, or no nameservers configured). For aggregate results where it was inapplicable, I've filtered international domain names (IDN) from the results (prefaced with xn--).

noviKorisnik
13. 06. 2006., 13:07
Aha, evo ... to je onda ovo - http://www.yafla.com/dforbes/2006/03/29.html

conica
13. 06. 2006., 13:17
jeste :)
ja sam to imala skinuto u dokumentu tako da nisam imala link da postavim

noviKorisnik
14. 06. 2006., 10:07
Kako je ovo dooobro!
(yet another five-letter acronym? YAFLA?)