Browsing: automated data extraction from pdf