How to deduplicate sentences in large .odt and .txt files?

Alas Poor Erinaceus@lemmy.ml · 3 days ago

How to deduplicate sentences in large .odt and .txt files?

adarza@piefed.ca · 3 days ago

it would compress well for an archival backup, so that’s what i’d do for the ‘originals’.

if your long message chains look anything like mine, there’s far more quoted material overall than new text in most mails; and not all new text would be relevant to whatever is being saved… so it’d be quicker to do it the other way–copy and paste what you did want to new documents instead of trying to clean up these long compilations by deleting what you don’t.