NVIDIA allegedly contacted Anna's Archive directly for access to ~500 terabytes of "pirated" books and papers for pre-training their LLMs Anna's warned them the collections were illegal and copyrighted. NVIDIA's data strategy team pushed anyway; executives gave the green light within days, per internal docs cited in the lawsuit.