Avoiding the dangers of metadata

What is metadata?

Most software applications store information behind the scenes which you are not normally aware of. This information is used to run the application and provide the functionality that you see in day-to-day use.

This hidden information is generally called "metadata".

Although most applications store metadata to some extent, the main concern from the perspective of a lawyer is the wordprocessor; this being Microsoft Word in most modern law firms.

Problems arise when documents are sent outside of the firm by e-mail without the metadata being removed. Metadata that is not removed can be seen by the recipient of the document either in the normal course of working with the document or if the recipient looks specifically for hidden metadata.

Metadata can cause embarrassment and damage the reputation of the sender. The most notorious example of this is the Iraq Dossier published by the UK Government in February 2003 which was found to have been copied from an journal article posted by post-graduate student. Click here to read more about this.

Metadata Examples

Given that word processors represent the highest risk to lawyers in terms of metadata, and that Word for Windows is the most common word processor in use within law firms, we will look in a little more detail at the metadata which can be hidden within Word.

All versions of Word store metadata, although it has become easier to detect and remove in the most recent versions of the software.

Typical metadata found within a Word document may include:

  • Identification information
    • Your name and initials
    • Your company name
    • The name/number of your computer
    • The file/network location where you saved the document
  • The names of others who have worked on the document
  • Document revision history
  • Different versions of the same document
  • Hidden text
  • Comments e.g. drafting notes
  • Template information
  • Information that your document management system may insert e.g. original matter/client number and document categorisation information
  • Smart Tags

Threats

  • Compromised negotiating position

    If you send a document to the other side which contains information that you thought had been removed from the document but which can still be recovered, you may comprise your position or give away information unintentionally

  • Embarrassment and brand damage

    Sending out a document with damaging metadata can imply a lack of care in document preparation and a lack of technical competence. Depending on the information found within the metadata, the situation could be highly embarrassing

Opportunities

  • Gain negotiating advantage

    If you can discover what has been deleted from previous drafts or track the general history of changes to a document, you may discover information that you can use to your advantage.

    Many lawyers who purchased metadata discovery tools to protect their own outgoing documents are now using them to analyse and dissect incoming documents.

  • Challenge bills

    You may be able to see from the hidden information who has been working on a document. This can often show that a more junior member of staff than the bill would indicate, has been working on the document

  • Protect intellectual property

    How many lawyers take their precedents with them from firm to firm? The hidden information can often show where a document was originally created i.e. which firm, which user and which file name etc.

    If you have new lawyers joining your firm and bringing precedent documents with them, then you need to remove the metadata from these documents. Failure to do this means that your clients could see that you are sending out documents which were originally created in another firm.

Solutions

The most important thing is to understand the threat and ensure that anyone producing documents in the firm or sending documents outside of the firm understands the issues.

Good document design and good practice will avoid some of the problems.

One of the simplest things that can be done is to save all documents as PDF files before they are sent out of the firm.

However, to ensure that all metadata is removed from a document and to ensure that this is done at the correct time, most firms will need to use some specialist software designed for this task.

While it is possible to develop your own solution which removes some of the metadata from your documents, removing all of it is not so easy.

Some can be removed with a few lines of programming code in Word for example but other elements of it are much more difficult to remove and require an expert understanding of how Word stores information in documents and similarly export programming skills to remove it.

Most firms will therefore opt for a commercial product of which there are a number.

Please use our supplier directory to find a list of suppliers providing this type of software.

While all of these products remove some metadata, some remove more than others, some work better than others and some offer a range of additional features which may or may not be important to you.

SOLUTION FINDER

Solution
Firm Size
Country
 

LEGAL WEB SEARCH

  Add to Google