Commit Graph

123 Commits

Author SHA1 Message Date
HugoPoi
4f467abf1e fix(scrape-manager): keywords propagated through a clone config for not being re-affected 2020-01-17 15:12:00 +01:00
HugoPoi
89dc5c3ebb fix(scrape-manager): conflict between proxies and user_agent option 2020-01-17 12:07:12 +01:00
HugoPoi
4b33ef9b19 fix(duckduckgo): extract correct amount of results, handle pagination 2020-01-15 16:35:16 +01:00
HugoPoi
28332528ea test(duckduckgo): implement tests for duckduckgo module 2020-01-15 16:33:30 +01:00
HugoPoi
b685fb4def test: working test for html_output 2020-01-10 09:51:54 +01:00
HugoPoi
394b567db6 test: add user_agent tests, add html_output tests 2020-01-10 09:35:24 +01:00
HugoPoi
cac6b87e92 test: Bing tests working, refactor proxy for tests 2020-01-08 14:40:28 +01:00
HugoPoi
1c1db88545 test: add config proxy options tests 2020-01-07 16:50:09 +01:00
HugoPoi
8f6317cea7 style: add debug trace on some file 2020-01-07 16:47:09 +01:00
HugoPoi
f192e4ebb4 test: remove legacy tests 2020-01-07 16:43:17 +01:00
HugoPoi
3ab8e46126 test: add bing module test 2020-01-07 09:48:46 +01:00
HugoPoi
392c43390e test(google): add real integration/unit tests for google module 2020-01-03 19:21:34 +01:00
HugoPoi
8f40057534 refactor(cluster): use custom concurrency for puppeteer-cluster 2019-12-20 19:44:59 +01:00
HugoPoi
301695cd2b fix(scrape-manager): proxy_file options can be used with proxies default value 2019-12-20 19:35:23 +01:00
Nikolai Tschacher
d362e4ae2c
Merge pull request #59 from TDenoncin/refactor/logging
Refactor logging
2019-12-20 14:42:09 +01:00
HugoPoi
bcd181111b refactor(log): remove common.js, use winston and debug 2019-12-15 17:56:22 +01:00
HugoPoi
b4a86fcc51 refactor(proxy): remove proxy option not working replace by proxies 2019-12-13 18:02:22 +01:00
Nikolai Tschacher
9e6a555663
Merge pull request #52 from kujaomega/master
Added post install script to build the puppeteer-cluster, and also ad…
2019-12-01 22:15:39 +01:00
David Solé
ca9f5f7f50 Added post install script to build the puppeteer-cluster, and also added the updated dependencies from puppeteer-cluster 2019-11-22 00:37:29 +01:00
Nikolai Tschacher
1694ee92d0 updated to puppeteeer 2.0 2019-11-08 16:21:16 +01:00
Nikolai Tschacher
da69913272 added detected status to metadata 2019-10-06 15:34:18 +02:00
Nikolai Tschacher
4a3a0e6fd4 better pluggable api 2019-10-05 19:39:33 +02:00
Nikolai Tschacher
4953d9da7a chaned version 2019-09-23 23:39:06 +02:00
Nikolai Tschacher
5e47c27c70 too late to find a proper commit description 2019-09-23 23:38:38 +02:00
Nikolai Tschacher
95a5ee56d8 remove cheerio from parsing 2019-09-23 21:57:13 +02:00
Nikolai Tschacher
52a2ec7b33 changed README 2019-09-23 16:50:57 +02:00
Nikolai Tschacher
07f3dceba1 fixed google SERP title, better docker support 2019-09-23 16:46:22 +02:00
Nikolai Tschacher
b25f7a4285 added test to my working tree 2019-09-13 18:28:19 +02:00
Nikolai Tschacher
4b581bd03f removed static tests because they are too larege 2019-09-13 18:21:17 +02:00
Nikolai Tschacher
21378dab02 removed some search engines, added tests for existing, added yandex search engines 2019-09-13 16:15:33 +02:00
Nikolai Tschacher
77d6c4f04a removed some stuff 2019-09-12 10:43:57 +02:00
Nikolai Tschacher
b513bb0f5b Merge branch 'master' of github.com:NikolaiT/se-scraper
server in dockerfile was changed
2019-09-04 12:28:05 +02:00
Nikolai Tschacher
855a874f9e some minor changes 2019-09-04 12:27:53 +02:00
Nikolai Tschacher
dde1711d9d
Merge pull request #45 from slotix/master
add process supervisor for starting server.js
2019-08-29 20:41:42 +02:00
slotix
7ba7ee9226 add process supervisor for starting server.js 2019-08-19 14:01:37 +02:00
Nikolai Tschacher
e661241f6f added some parsing to google 2019-08-16 20:10:40 +02:00
Nikolai Tschacher
98414259fe docker support added 2019-08-13 17:35:06 +02:00
Nikolai Tschacher
19a172c654 better tests 2019-08-13 15:28:30 +02:00
Nikolai Tschacher
0f7e89c272 added little bug in cleaning 2019-08-12 17:16:37 +02:00
Nikolai Tschacher
ca941cee45 added static bing test, added html cleaning when exporting html 2019-08-12 16:05:17 +02:00
Nikolai Tschacher
4c77aeba76
Merge pull request #42 from TDenoncin/error-management
Clean integration tests with mocha
2019-08-12 00:04:40 +02:00
Nikolai Tschacher
0427d9f915
Merge branch 'master' into error-management 2019-08-12 00:04:27 +02:00
Nikolai Tschacher
87fcdd35d5 readme in static tests 2019-08-12 00:01:02 +02:00
Nikolai Tschacher
4ca50ab2b9 added new static test case that runs much faster and tests a lot of behavior 2019-08-11 23:58:10 +02:00
Nikolai Tschacher
8e629f6266
Merge pull request #41 from victor9000/master
Fix broken Google News selectors, fixes #40
2019-08-08 21:57:14 +02:00
HugoPoi
a369bd07f9 Add "use strict" to ensure quality code control 2019-08-06 12:18:51 +02:00
HugoPoi
dde2b14fc0 Remove uneeded try catch block in Google Search module 2019-08-06 11:50:08 +02:00
HugoPoi
0db6e068da Remove uneeded try catch block
Add proper error for ip matching test
2019-08-06 11:46:53 +02:00
HugoPoi
50bda275a6 Clean integration tests for mocha 2019-08-05 17:01:48 +02:00
Victor
a61fade2c9 Fix broken Google News selectors, fixes #40 2019-08-04 14:43:02 -07:00