Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

You're right that all I have to do is find the first line of the footnotes, because everything above is the text and everything under it is footnotes.

For now I have selected a crude approach: there is a gap in the page between the text and the notes, of about one line height. So if one simply takes all the first words of each line and compares their vertical distance, when that distance grows significantly, it's where the footnotes start.

I have tested this method on a dozen of pages and it works, but it remains to be seen if it will stand the test of many pages, esp. those that are askew.

Using the average width of letters instead of their height is a neat idea though; visually it's undeniable that there is a greater difference of width than of height between the footnotes and the main text. I may resort to that if the crude approach proves too simple!



Good luck




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: