In common parlance, “metadata” refers to “data which is easily neglected.” Its original, ten years ago, actual meaning is “data about data.”
If you’re going to publish a document and wish to ensure that only data has been neglected from review (no data is “hiding”), then print your final version to pdf. Even is it is a pdf document you are editing and have carefully redacted, print it to pdf. See Data Redaction: You’re Doing it Wrong (SANS). See Rick’s Guide to using Redaction in Acrobat X Pro (Adobe).
What’s a Little Metadata Mining Between Colleagues? (Law Technology Review, no longer available)
Accidental publicist: Agencies risk unwitting release of sensitive information using popular office software (Federal Computer Week, no longer available)
Without a trace Government Computer News (GCN) article about information leakage and the Bush administration
How NOT to redact a PDF – Nuclear submarine secrets spilled blog post at Sophos
Production, Preservation and Disclosure of Metadata by J. Brian Beckham, Columbia Science and Technology Review (STIR)
A metadata manager should be familiar with the Data Management Association (DAMA) Data Management Body of Knowledge (DMBOK) and the ISO/IEC 11179 Metadata Registry (MDR) standard [pdf] as well as technologies and standards for data and document tagging (XML, XML Schema, RDF and other basic semantic web standards and tools.)
METS Metadata Encoding and Transmission Standards (METS)
Phil Harvey’s ExifTool is a platform-independent Perl library plus a command-line application for reading, writing and editing meta information in a wide variety of files. ExifTool supports many different metadata formats including EXIF, GPS, IPTC, XMP, JFIF, GeoTIFF, ICC Profile, Photoshop IRB, FlashPix, AFCP and ID3, as well as the maker notes of many digital cameras by Canon, Casio, FujiFilm, HP, JVC/Victor, Kodak, Leaf, Minolta/Konica-Minolta, Nikon, Olympus/Epson, Panasonic/Leica, Pentax/Asahi, Ricoh, Sanyo, Sigma/Foveon and Sony.
metadataminer.com Metadata Miner Catalogue – List folders and documents summary information, extraction of file properties and IPTC metadata, manage, update information associated with MS Office, …
stellent.com Stellent’s Bitform Discover – metadata discovery
Microsoft Office 2003/XP: Remove Hidden Data Add-in With this add-in you can permanently remove hidden data and collaboration data, such as change tracking and comments, from Microsoft Word, Microsoft Excel, and Microsoft PowerPoint files.
Find and Remove Metadata (Hidden Information) in your Legal Documents. This Microsoft resource is helpful for anyone who uses Word, Excel and PowerPoint.
Redacting with Confidence: How to Safely Publish Sanitized Reports Converted from Word to PDF. [pdf] The National Security Agency guide should be mandatory reading for all government users who create PDF documents.
KKLsoftware.com ezClean – metadata discovery and removal tool
Workshare Trace Workspace’s free tool for uncovering metadata and hidden data in Office documents.
SRS Technologies Document Detective. The software from SRS Technologies of Hunstville, Ala., reviews Microsoft Office and Adobe PDFs for hidden metadata, showing the results. A related product, the Electronic Document Review System, removes hidden and extraneous data.
Litera released Metadacte 2.0, allowing companies to prevent metadata leaks, including information sent from mobile devices.