RLee
About
Ryan Lee
Categories
All
(2)
benchmarks
(1)
extraction
(2)
llm
(2)
web-extraction
(2)
⚠️ I am in the progress of migrating my website. Thank you for your patience!
NEXT-EVAL: Next Evaluation of Traditional and LLM Web Data Record Extraction
extraction
web-extraction
llm
benchmarks
Web data record extraction is a problem of extracting repeated sets of semantically related elements from web pages. Effective evaluation of web data record extraction is…
Jun 20, 2025
Ryan Lee
XPath Agent: Automating Web Scraping with LLM‑Built XPaths
extraction
web-extraction
llm
Writing durable XPath for web scraping is time-consuming and brittle. A good XPath must work across multiple page variants, not just one.
XPath Agent
(Yu Li, Bryce Wang…
Jun 19, 2025
Ryan Lee
No matching items