OK. I have to say Lightroom wouldn't be my first choice of management software but it's what my client wants to use and I'm about at my wits' end. Here's the situation:
I already made a catalogue of 10 of the 13 years with the recommended Master Folder/Year/YYYY-MM-DD structure and the photographer asked me to re-name files in a Date+Capture Time format. Unfortunately, some of the drives were missed and (even more unfortunately) LR's renaming proved very shaky - in particular in identifying duplicates and also where images were shot in burst mode as it appears not to read data in fractions of a second. As a result, I ended up with having to identify and remove some 50,000 duplicates but in retrospect should really have started over and kept the original file names. (Why, oh why doesn't LR store the original camera filename somewhere in the Metadata and offer a simple "revert and check for duplicates" option?) As you can imagine this has taken a huge amount of time as moving all the data around at USB2 speeds is pedestrian.
I can also tell anyone who's been wondering that catalogue sizes of 200,000 images plus are slow, difficult to manage and I suspect unreliable with supposedly simple tasks like checking catalogue integrity taking a very long time. Files often show as missing even though they are in the correct place.
Nearly finished....
So now I have a 2.6Tb catalog of part of the archive, some of the files in which have been re-named and around 12Tb of data which almost certainly contains some valuable new images that need to be added along with a bunch of dupes that don't. Starting over seems attractive as I might face less dupe issues but the downside is that I don't have an empty 3Tb drive handy - there's always something! Your thoughts, then:
- 29 external drives from 250Gb to 3Tb - 4 are USB3 and the rest USB2 and/or Firewire
- 5 formatted on a Mac, 23 are NTFS (PC) format and one with The Click Of Death which is probably headed for the bin.
- About 9Tb of (irreplaceable) files: JPG; CR2; Mov; and various other video codecs. Mostly still images - somewhere between 400 and 500k to date.
- The image and video files are duplicated at random over 24 of the drives with originals, backups and backups of backups
- Some have been renamed in a previous attempt at cataloguing (using Aperture, I think) and there are also many dupe file names since for a few years the author shot RAW+JPG
- My gear: i7 Pc with 12Gb RAM. Win7 x64
I already made a catalogue of 10 of the 13 years with the recommended Master Folder/Year/YYYY-MM-DD structure and the photographer asked me to re-name files in a Date+Capture Time format. Unfortunately, some of the drives were missed and (even more unfortunately) LR's renaming proved very shaky - in particular in identifying duplicates and also where images were shot in burst mode as it appears not to read data in fractions of a second. As a result, I ended up with having to identify and remove some 50,000 duplicates but in retrospect should really have started over and kept the original file names. (Why, oh why doesn't LR store the original camera filename somewhere in the Metadata and offer a simple "revert and check for duplicates" option?) As you can imagine this has taken a huge amount of time as moving all the data around at USB2 speeds is pedestrian.
I can also tell anyone who's been wondering that catalogue sizes of 200,000 images plus are slow, difficult to manage and I suspect unreliable with supposedly simple tasks like checking catalogue integrity taking a very long time. Files often show as missing even though they are in the correct place.
Nearly finished....
So now I have a 2.6Tb catalog of part of the archive, some of the files in which have been re-named and around 12Tb of data which almost certainly contains some valuable new images that need to be added along with a bunch of dupes that don't. Starting over seems attractive as I might face less dupe issues but the downside is that I don't have an empty 3Tb drive handy - there's always something! Your thoughts, then:
- Have I missed some ability in LR to revert to the in-camera filename?
- Start over? If so, what size of catalogue(s) to choose this time?
- Is there a way to avoid importing the JPG files that were shot along with the RAW, given that there are often RAW+JPG pairs, RAW only and JPG only in the same folder? I know you can stack the thumbnails but also want to avoid having LR count the pair as one because it makes a rough check of the catalogue by comparing stats from Windows or FileTree impossible.
- I'd prefer to avoid sidecar files as my client's computer attention span is limited and I'm afraid they will get detached before long. Embed Metadata is an OK option (I see posts here suggesting sidecar in response to someone else in a mess)?
- Optimum catalogue settings advice from anyone with a similar sized archive would be appreciated.
- Any other useful tips?
Last edited: