Only 78 entries about hate:
Talking about foxes spiked at about 10 mentions a month in late 2005:
343 posts about McLeod Residence had a good run, but are now trailing off:
Looks like I've been talking more and more about being drunk over the last couple years:
Could be because of my growing appreciation of champagne:
Here's when I met Kellianne:
I'm an early adopter of Flickr:
And Twitter:
And the iPhone:
But my paranoia of asteroids and super volcanoes seems to have passed for the most part:
Adding a Lucene search engine to all 12,388 entries from the last 10 years is pretty interesting. Mixing the data with Google's chart API makes it even more interesting (to me at least). Now I just have to create more content for it to eat up.
Suggest other ways to mess with all of this.

scatterplot?
2009-04-23 05:55 pm (UTC)
Re: scatterplot?
2009-04-23 05:58 pm (UTC)
Re: scatterplot?
2009-04-23 06:22 pm (UTC)
http://code.google.com/apis/chart/t
You can also do a linear least squares or something and calculate correlation coefficients to get a number for how correlated they are, but if there's anything really interesting you'll probably see it right away on the graph.
Re: scatterplot?
2009-04-23 06:01 pm (UTC)
http://en.wikipedia.org/wiki/Scatter_pl
2009-04-23 06:29 pm (UTC)
2009-04-23 06:39 pm (UTC)
2009-04-23 06:45 pm (UTC)
http://www.elsewhere.org/pomo/
2009-04-23 06:38 pm (UTC)
http://www.joesniff.co.uk/projects/late
What this method can do is find collections of words that tend to be found together... it assumes that there are some innate relationships between words that are found together in the same documents / paragraphs / sentences / whatever.
But this might be TOO complex...
2009-04-23 06:41 pm (UTC)
2009-04-23 06:44 pm (UTC)
2009-04-23 06:57 pm (UTC)
I should also see how many words I've used in 10 years... probably a good approximation of the size of my vocabulary.
2009-04-23 07:00 pm (UTC)
2009-04-23 07:02 pm (UTC)
2009-04-23 07:09 pm (UTC)
And think of it less as "working for me" and more as an incidental way of gaining a little academic glory as a useful byproduct. :) Nah... I'm thinking more that if there are some good tools out there, just let me know. It's a topic that I've been interested in for a while.
2009-04-23 07:11 pm (UTC)
Okay, I'll let you know what I find. The field of latent semantic indexing is pretty established though... are you just looking for something really simple that lets you pass text into it and get results out of it? Or does it have to work with a particular file format?
2009-04-23 07:22 pm (UTC)
LSA/LSI seems pretty well-established in the information technology field, but I have yet to find it applied to the field I'm examining, which means that it would make for a great new contribution! Not a bad way to get published.
There are some commercially available packages, but I don't think that they would be able to do some of the things that I'd like to do with it... in particular, it would be great to get my hands on Python code (or Ruby code, depending on how quickly I could pick it up) that I could change around myself. On the other hand, it would be great to have code that I could use out of the box quickly without modification... no need to reinvent the wheel to answer very simple questions.
Not to mention that I'm quite cheap when it comes to things that I might not use too often for research. I'd rather use Python modules that I can play with for free than to buy a text mining program for several hundred dollars that is not customizable, that could quite possibly get only a few uses.
2009-04-23 06:43 pm (UTC)
2009-04-23 06:51 pm (UTC)
more pretty charts
2009-04-23 09:24 pm (UTC)
Re: more pretty charts
2009-04-23 09:53 pm (UTC)
2009-04-24 03:30 pm (UTC)
how much skill does that require of someone, to make those charts?
2009-04-25 06:33 pm (UTC)
http://code.google.com/apis/chart/