We should be able to create lists to match against for: * large blogs (techcrunch, healthline, etc) * news sites (cnn, reuters, etc) * PR release sites * etc