se-scraper/TODO.txt
2019-01-27 15:54:56 +01:00

20 lines
579 B
Plaintext

24.12.2018
- fix interface to scrape() [DONE]
- add to Github
24.1.2018
- fix issue #3: add functionality to add keyword file
27.1.2019
- Add functionality to block images and CSS from loading as described here:
https://www.scrapehero.com/how-to-increase-web-scraping-speed-using-puppeteer/
https://www.scrapehero.com/how-to-build-a-web-scraper-using-puppeteer-and-node-js/
TODO:
- add proxy support
- add captcha service solving support
- check if news instances run the same browser and if we can have one proxy per tab wokers