Logo
Explore Help
Sign In
iNocturnis/webcrawler
1
0
Fork 0
You've already forked webcrawler
Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity
34 Commits 3 Branches 0 Tags
8d5a669d9e5b813dbd612468ccb51500b2475609
Commit Graph

7 Commits

Author SHA1 Message Date
Hieuhuy Pham
8d5a669d9e Added some trap detection for really bad links 2022-04-25 15:54:57 -07:00
Hieuhuy Pham
c1b7a50460 Locks are not racing anymore, locks work multi-thread works, change some storing information stuff so its more readble, add some new regex but it will need to be trim later because it does not do its job 2022-04-23 18:49:24 -07:00
Hieuhuy Pham
74063e5d00 Fixed a lot of racing issues, there potentially could be a writer reader confusion type of thing, but it should not matter that much, as long as server is healthy we can let this bad boi lose 2022-04-23 02:13:12 -07:00
Hieuhuy Pham
90a5d16456 Load balancer installed, havent not been able to test yet 2022-04-22 16:51:32 -07:00
Hieuhuy Pham
320fe26c23 Added basic multi-threading, reader-first implementation 2022-04-21 19:44:30 -07:00
Hieuhuy Pham
58d15918d5 Change more syntax to get data collection working, check extracturl and sorted links into sets instead of lists to signifcantly reduce url extractions 2022-04-20 04:03:58 -07:00
Hieuhuy Pham
d0dde4a4db Fixes error in syntax for new merged code from data collection branch, fixed 'infinite loop', added timers to measure performance of functions. 2022-04-20 03:52:14 -07:00
Powered by Gitea Version: 1.24.6 Page: 24ms Template: 2ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API