Commit Graph

  • 5fd5319ffb Everything done and ready to test main inocturnis 2022-05-27 23:00:45 -0700
  • 3e047aec45
    test and readme txt search Aaron 2022-05-27 21:37:38 -0700
  • 63c9bbee6f Added functionality of creating the index through the html inocturnis 2022-05-27 17:39:34 -0700
  • 107d1b2a46 Basic web-gui inocturnis 2022-05-27 17:01:35 -0700
  • e325b9d810 Same as previous push unknown 2022-05-27 13:12:15 -0700
  • 0b127af5a9 Removal of more crap related to the old weight scheme inocturnis 2022-05-27 10:41:10 -0700
  • ebc72baa7d Changed to real data to run again inocturnis 2022-05-27 10:39:57 -0700
  • 8d2dcea172 We are looking for TF_WEIGHT not IDF_WEIGHT, make things A LOT CHEAPER inocturnis 2022-05-27 10:39:13 -0700
  • cf81cf5c6a changing the way to calculate weight inocturnis 2022-05-27 09:53:25 -0700
  • fb88efd510 Implemented all necessary indexer informations inocturnis 2022-05-27 06:29:48 -0700
  • c43d6aa0a9 Fully changed indexer and worker classes with properly indexing inocturnis 2022-05-27 05:11:01 -0700
  • 53c7b49806 Massive changes to indexer and created merge inocturnis 2022-05-27 03:08:56 -0700
  • 60f6eb0df0 search functionality to obtain set of documents unknown 2022-05-26 23:34:29 -0700
  • 95ba16cf2e added normalizing functionality + tfidf unknown 2022-05-26 01:05:26 -0700
  • d80a977450 Added way to save doc score unknown 2022-05-25 19:59:31 -0700
  • a567424a54 created new tf-idf and changed posting class unknown 2022-05-25 18:41:36 -0700
  • a736e05d00 changed tf-idf unknown 2022-05-25 18:39:02 -0700
  • d9fdee7b87 Added way to save ngrams to index unknown 2022-05-13 16:42:33 -0700
  • a2869afcbe
    Merge e7c4170cc2 into c4b3512df7 Aaron 2022-05-12 17:58:40 -0700
  • e7c4170cc2
    Update indexer.py Lacerum-patch-1 Aaron 2022-05-12 17:58:31 -0700
  • c4b3512df7 Changed tf_idf model into the new one, try it on the current dataset inocturnis 2022-05-12 15:00:09 -0700
  • c8640001c7 Merge branch 'tf_idf' iNocturnis 2022-05-12 14:30:22 -0700
  • 808ed56bb7 Nothing changed just added a space unknown 2022-05-11 17:22:01 -0700
  • f5610eaa62 tf-idf ngrams and now returns dict rather than score tf_idf Lacerum 2022-05-11 14:46:32 -0700
  • f1fe3b26ac Merged with weighting but cannot implement due to tokens being messy and some comparison error inocturnis 2022-05-06 20:45:52 -0700
  • 5c703b6471 Merge remote-tracking branch 'origin/posting' iNocturnis 2022-05-06 20:26:03 -0700
  • c892bbac03 Changed counter for tf to one doing O(n) instead of O(n^2), included multi-threading to speed up processing speed inocturnis 2022-05-06 20:22:52 -0700
  • efb2c4e2a8 added important tokens posting unknown 2022-05-06 17:19:37 -0700
  • c616b37432 added important tokens unknown 2022-05-06 17:18:34 -0700
  • 8e7013e840 Merge branch 'main' into tf_idf iNocturnis 2022-05-06 14:58:48 -0700
  • c05b4c7b09 Changed some files and tf_idf, added data storage, and finish the loop for indexing inocturnis 2022-05-06 14:58:03 -0700
  • b82516ec85 attempted fix for if-idf Lacerum 2022-05-06 14:03:49 -0700
  • b833afbfa3 filled out get_tf_idf, added test file for it Lacerum 2022-05-06 04:04:04 -0700
  • 81da17de93 Stemmed done inocturnis 2022-05-04 15:30:01 -0700
  • 0cb72cbed4 Added shelves into ignore inocturnis 2022-05-04 13:27:56 -0700
  • d3a4da88ea Removed shelves inocturnis 2022-05-04 13:27:16 -0700
  • fbb1a1ab2c Implemented a starting point for the project, run indexer.py, it will stop after 1 single file, a very rudimentary tokenzier implemented. inocturnis 2022-05-04 13:26:18 -0700
  • 1fb8fef7a3 First pushed, setup all the stuff we need, no launcher yet. So test your code in another place for now, because they are all codepended on each others ... Hieuhuy Pham 2022-05-04 12:22:20 -0700
  • 5875ac0e79
    Update README.md iNocturnis 2022-05-03 21:32:33 -0700
  • 3a4359c53c
    Update README.md iNocturnis 2022-05-03 21:00:01 -0700
  • 884ae4a22f
    Update README.md iNocturnis 2022-05-03 20:07:53 -0700
  • 6eca34ba71
    Initial commit iNocturnis 2022-05-03 20:07:26 -0700