Entity finding in a document collection using adaptive window sizes