Say there is a 3TB TXT file, in which every line is a string, how to find those duplicated strings in them? It's an interview question from a friend of mine. We'd better make those questions clear enough after an interview, in case of the next one.
PS: If I'm the interviewee, I will tell the interviewer: How can you guys store so many strings in a TXT file? It's really a bad idea!