JIRCStats

JIRCStats is a Java application which parses and analyses IRC log files and generates statistics from them. It is currently under development. You can see some sample output
here, and a sample configuration (which at the time of writing contains several sections which are ignored as their corresponding features aren't implemented yet)
here.
Todo
Bolded items are ones which need to be done before the first release, really
- Filter/limit options for user table
- Devise proper nick linking algorithm
- Way to template jobs in the config
- Nickname extractor
- Character counter
- (Better) error handling, rather than just allowing exceptions to be thrown if the user makes an error in the config
- Output or suggest a default config if one doesn't exist
- Time of date outputter labels setting should be configurable
- Small webpage describing project + links to examples
- Abstract time of date outputter to cope with different data sources (months, days, etc)
- Outputter for big numbers/awards
- Algorithm to read overlapping log files
- Read <format> section of config
- Custom parser based on format XML
- Make user stuff work properly per-job
- Abstract the bot stuff so it can all be customised
- Dynamic list of scripts to be included
- Extractor to round numbers
- Better split up outputter/resources/style outputter
- Way for user table things to go on to another row
- Way to remove duplicate lines
- Options to quote context thing should be configurable
- User table should take outputters
- Possible bug with last seen times
- Make nick merging algorithm configurable
- Quote context shouldn't show nickname when not showing context
- Context should include lines immediately following a quote
- Context should show date/time
Nick merging algorithms
- Host matching - combination of ident, host and/or domain; optional minimum nick distance
- Nick matching - merge anyone who uses the same nickname for longer than the same amount of time
- Nick prefixing - merge anyone whose nickname shares a prefix with someone else, with options for delimiters and distance
- Rejoin matching - merge anyone who joins within a certain period before/after a quit, with a minimum nick distance
- Manual - merge nicks according to configuration
Recent changes
| Rev | Date | Author | Log |
|---|
| 57 | Sat 26th April 2008 | chris | Dynamic list of scripts to be included
|
| 56 | Sat 26th April 2008 | chris | # Quote context shouldn't show nickname when not showing context
|
| 55 | Sat 26th April 2008 | chris | Fix deliberate error in config
|
| 54 | Sat 26th April 2008 | chris | Better error reporting
|
| 53 | Tue 22nd April 2008 | chris | UTOD work
Apparently missing a bunch of source files
|
| 52 | Mon 21st April 2008 | chris | Add jdom and apache commons lang
HTML escape everything that needs it
|
| 51 | Mon 21st April 2008 | chris | Per-user time of date extractor
|
| 50 | Sun 20th April 2008 | chris | Last seen extractor
Users still active at the end of the run are assumed to have quit after they were last active, rather than at the last log line
|
| 49 | Sun 20th April 2008 | chris | Quote context stuff
Extractors can also be outputters
|
| 48 | Sun 20th April 2008 | chris | Abstract extractors/analysers somewhat
Initial work on quote context
|
CategoryProjects CategoryJava CategoryIRC