{"id":800,"date":"2003-04-13T20:28:00","date_gmt":"2003-04-14T05:28:00","guid":{"rendered":"http:\/\/www.cloudidentity.com\/blog\/2003\/04\/13\/word-count-of-dnwl\/"},"modified":"2013-03-16T13:02:42","modified_gmt":"2013-03-16T22:02:42","slug":"5548","status":"publish","type":"post","link":"https:\/\/www.cloudidentity.com\/blog\/2003\/04\/13\/5548\/","title":{"rendered":"Word count of DNWL"},"content":{"rendered":"<p><P>I thought it could have been interesting to see the word occurrencies in the various blogs of DNWL, just to seek further confirmation of the geeky attitude of the community \ud83d\ude42<\/P><br \/>\n<P>So I just implemented a quick DictionaryTree and I scanned the SpecialFolder26SharpReaderCache (leaving in it DNWL files only). For the sake of simplicity, I took in consideration only Title and Description InnerTexts and I stripped all HTML tags.<\/P><br \/>\n<P>The results are interesting, and definitely funny. You can find the complete list of terms plus occurrencies (over 14.500 terms, including some aberration in the end produced by the homebrewed parser) <A href=\"http:\/\/dotnetweblogs.com\/vbertocci\/Story\/5547.aspx\">here<\/A>.<\/P><br \/>\n<P>Fun facts:<\/P><br \/>\n<P>1) the term <FONT color=\"red\">NET<\/FONT> occurs 1700 times, between two powerful buzz words (ON and MY)<BR>2) <FONT color=\"red\">CODE<\/FONT> and <FONT color=\"red\">WEB<\/FONT> counts 561 and 535 respectively, between THEY and WOULD<BR>3) <FONT color=\"red\">DON<\/FONT>, not surprisingly, is the most used name with 454 occurrences: and I have the slight impression that that doesn&#8217;t match with the stats about the most common English name \ud83d\ude42<BR>4) the term <FONT color=\"red\">BLOGS<\/FONT> pops up 444 times<BR>5) <FONT color=\"red\">MICROSOFT<\/FONT> wins the most referenced company contest, with 369;<BR>6) <FONT color=\"red\">C#<\/FONT> is the most quoted language, with 273 entries (between 2003 and SHOULD)<BR>7) <FONT color=\"red\">SCOTT<\/FONT> seems another extremely common name, with 168 entries<\/P><br \/>\n<P>OK, ok. The system I used is FAR from being perfect, I should have preserved the &#8220;.&#8221; in front of &#8220;NET&#8221; and so on, but since the frequencies of the &#8220;normal&#8221; words are meaningful I believe that the analisys has some significance. In few weeks I&#8217;ll repeat the process, just to see if it will give insight on trends.<\/P><br \/>\n<P>&nbsp;<\/P><br \/>\n<P><A href=\"http:\/\/dotnetweblogs.com\/vbertocci\/Story\/5547.aspx\"><\/A>&nbsp;<\/P><\/p>\n<div style=\"clear:both\"><\/div>\n","protected":false},"excerpt":{"rendered":"<p>I thought it could have been interesting to see the word occurrencies in the various blogs of DNWL, just to seek further confirmation of the geeky attitude of the community \ud83d\ude42 So I just implemented a quick DictionaryTree and I scanned the SpecialFolder26SharpReaderCache (leaving in it DNWL files only). For the sake of&#8230;<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_kad_post_transparent":"","_kad_post_title":"","_kad_post_layout":"","_kad_post_sidebar_id":"","_kad_post_content_style":"","_kad_post_vertical_padding":"","_kad_post_feature":"","_kad_post_feature_position":"","_kad_post_header":false,"_kad_post_footer":false,"footnotes":""},"categories":[35,60],"tags":[],"class_list":["post-800","post","type-post","status-publish","format-standard","hentry","category-useless","category-wild-ideas"],"_links":{"self":[{"href":"https:\/\/www.cloudidentity.com\/blog\/wp-json\/wp\/v2\/posts\/800","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.cloudidentity.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.cloudidentity.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.cloudidentity.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.cloudidentity.com\/blog\/wp-json\/wp\/v2\/comments?post=800"}],"version-history":[{"count":3,"href":"https:\/\/www.cloudidentity.com\/blog\/wp-json\/wp\/v2\/posts\/800\/revisions"}],"predecessor-version":[{"id":1951,"href":"https:\/\/www.cloudidentity.com\/blog\/wp-json\/wp\/v2\/posts\/800\/revisions\/1951"}],"wp:attachment":[{"href":"https:\/\/www.cloudidentity.com\/blog\/wp-json\/wp\/v2\/media?parent=800"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.cloudidentity.com\/blog\/wp-json\/wp\/v2\/categories?post=800"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.cloudidentity.com\/blog\/wp-json\/wp\/v2\/tags?post=800"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}