Tag: LegalTech
All the articles with the tag "LegalTech".
-
Building FineWeb-Legal: A 10B Token Pilot
Published:• 2 min readHow I extracted 67 million words of legal text from 10B tokens of web data using heuristics and classifiers.
All the articles with the tag "LegalTech".
How I extracted 67 million words of legal text from 10B tokens of web data using heuristics and classifiers.