This website requires JavaScript.
5fd5319ffb
Everything done and ready to test
main
inocturnis
2022-05-27 23:00:45 -0700
3e047aec45
test and readme txt
search
Aaron
2022-05-27 21:37:38 -0700
63c9bbee6f
Added functionality of creating the index through the html
inocturnis
2022-05-27 17:39:34 -0700
107d1b2a46
Basic web-gui
inocturnis
2022-05-27 17:01:35 -0700
e325b9d810
Same as previous push
unknown
2022-05-27 13:12:15 -0700
0b127af5a9
Removal of more crap related to the old weight scheme
inocturnis
2022-05-27 10:41:10 -0700
ebc72baa7d
Changed to real data to run again
inocturnis
2022-05-27 10:39:57 -0700
8d2dcea172
We are looking for TF_WEIGHT not IDF_WEIGHT, make things A LOT CHEAPER
inocturnis
2022-05-27 10:39:13 -0700
cf81cf5c6a
changing the way to calculate weight
inocturnis
2022-05-27 09:53:25 -0700
fb88efd510
Implemented all necessary indexer informations
inocturnis
2022-05-27 06:29:48 -0700
c43d6aa0a9
Fully changed indexer and worker classes with properly indexing
inocturnis
2022-05-27 05:11:01 -0700
53c7b49806
Massive changes to indexer and created merge
inocturnis
2022-05-27 03:08:56 -0700
60f6eb0df0
search functionality to obtain set of documents
unknown
2022-05-26 23:34:29 -0700
95ba16cf2e
added normalizing functionality + tfidf
unknown
2022-05-26 01:05:26 -0700
d80a977450
Added way to save doc score
unknown
2022-05-25 19:59:31 -0700
a567424a54
created new tf-idf and changed posting class
unknown
2022-05-25 18:41:36 -0700
a736e05d00
changed tf-idf
unknown
2022-05-25 18:39:02 -0700
d9fdee7b87
Added way to save ngrams to index
unknown
2022-05-13 16:42:33 -0700
a2869afcbe
Merge e7c4170cc2
into c4b3512df7
Aaron
2022-05-12 17:58:40 -0700
e7c4170cc2
Update indexer.py
Lacerum-patch-1
Aaron
2022-05-12 17:58:31 -0700
c4b3512df7
Changed tf_idf model into the new one, try it on the current dataset
inocturnis
2022-05-12 15:00:09 -0700
c8640001c7
Merge branch 'tf_idf'
iNocturnis
2022-05-12 14:30:22 -0700
808ed56bb7
Nothing changed just added a space
unknown
2022-05-11 17:22:01 -0700
f5610eaa62
tf-idf ngrams and now returns dict rather than score
tf_idf
Lacerum
2022-05-11 14:46:32 -0700
f1fe3b26ac
Merged with weighting but cannot implement due to tokens being messy and some comparison error
inocturnis
2022-05-06 20:45:52 -0700
5c703b6471
Merge remote-tracking branch 'origin/posting'
iNocturnis
2022-05-06 20:26:03 -0700
c892bbac03
Changed counter for tf to one doing O(n) instead of O(n^2), included multi-threading to speed up processing speed
inocturnis
2022-05-06 20:22:52 -0700
efb2c4e2a8
added important tokens
posting
unknown
2022-05-06 17:19:37 -0700
c616b37432
added important tokens
unknown
2022-05-06 17:18:34 -0700
8e7013e840
Merge branch 'main' into tf_idf
iNocturnis
2022-05-06 14:58:48 -0700
c05b4c7b09
Changed some files and tf_idf, added data storage, and finish the loop for indexing
inocturnis
2022-05-06 14:58:03 -0700
b82516ec85
attempted fix for if-idf
Lacerum
2022-05-06 14:03:49 -0700
b833afbfa3
filled out get_tf_idf, added test file for it
Lacerum
2022-05-06 04:04:04 -0700
81da17de93
Stemmed done
inocturnis
2022-05-04 15:30:01 -0700
0cb72cbed4
Added shelves into ignore
inocturnis
2022-05-04 13:27:56 -0700
d3a4da88ea
Removed shelves
inocturnis
2022-05-04 13:27:16 -0700
fbb1a1ab2c
Implemented a starting point for the project, run indexer.py, it will stop after 1 single file, a very rudimentary tokenzier implemented.
inocturnis
2022-05-04 13:26:18 -0700
1fb8fef7a3
First pushed, setup all the stuff we need, no launcher yet. So test your code in another place for now, because they are all codepended on each others ...
Hieuhuy Pham
2022-05-04 12:22:20 -0700
5875ac0e79
Update README.md
iNocturnis
2022-05-03 21:32:33 -0700
3a4359c53c
Update README.md
iNocturnis
2022-05-03 21:00:01 -0700
884ae4a22f
Update README.md
iNocturnis
2022-05-03 20:07:53 -0700
6eca34ba71
Initial commit
iNocturnis
2022-05-03 20:07:26 -0700