Download version of 29 Nov 2005
This program will look through all the files in a directory of your choice and produce a table that summarizes the metadata for those files, as shown in the following screen shot.
After you download the file, you may run it from the command line by typing:
java -jar MetadataAggregator.jar
To choose a directory to summarize, use the Open... option in the File menu.
You may click any column heading to sort on that column. You may add additional sort columns by CTRL-clicking those columns.
Normally, the program will not search subdirectories. You can set that option from the Options menu.
You may export the summarized data as a tab-delimited file by choosing Export... from the File menu.
If you just want to use the aggregator, you can stop reading now. If you want to change the code or add translations, keep reading.
If you want to provide translations of this program to other languages, edit the AggregatorBundle.properties file and save it with your language prefix and country. (For example, a Mexican Spanish translation would go into file AggregatorBundle_es_MX.properties.) Similarly, edit and save the helpfile.html file in the same manner, which would become helpfile_es_MX.properties.
In both cases, the country code and its leading underscore are optional. For generic Spanish, you would produce files AggregatorBundle_es.properties and helpfile_es.html. A (very) badly translated version is included in the download.
The aggregator doesn’t show all of the data it collects (and it
doesn’t collect absolutely everything in the metadata file either).
Here is a list of what the underlying libraries collect. Items in italics are
collected but not displayed. An item whose name begins with
meta: is an item specific to OpenDocument. An item whose internal
name begins with dc: is from the
Dublin Core Metadata Initiative
specification.
meta:generator (program that generated the document)dc:titledc:descriptiondc:subjectmeta:keywordmeta:initial-creatordc:creator (last person to edit the document)meta:printed-bymeta:creation-datedc:date (last date the document was edited)meta:print-datedc:language;meta:editing-cycles (number of times document was edited)meta:editing-duration (total time spent editing document)meta:page-countmeta:table-countmeta:image-countmeta:word-countmeta:character-countmeta:row-count (total number of table rows)meta:cell-count (total number of table cells)meta:draw-countmeta:ole-object-countmeta:paragraph-countmeta:frame-countmeta:sentence-countmeta:syllable-countmeta:non-whitespace-character-countmeta:object-count