Please use this identifier to cite or link to this item: http://dspace.azjhpc.org/xmlui/handle/123456789/67
Full metadata record
DC FieldValueLanguage
dc.contributor.authorSuleymanzade, Suleyman-
dc.date.accessioned2023-04-28T22:42:48Z-
dc.date.available2023-04-28T22:42:48Z-
dc.date.issued2021-12-
dc.identifier.issn2616-6127-
dc.identifier.issn2617-4383-
dc.identifier.otherhttps://doi.org/10.32010/26166127.2021.4.2.263.266-
dc.identifier.urihttp://localhost:8080/xmlui/handle/123456789/67-
dc.description.abstractThis article presented a survey of two well-known algorithms, TF-IDF and BM-25 methods, for document ranking on a single CPU and parallel processes via HPC. An amazon review dataset with more than two million reviews was measured to measure the rank parameters. We set up the number of workers for the parallel processing during the experiment, which we selected as one and three. Four benchmarks evaluated the preprocess and reading time, vectorization time, TF-IDF transformation time, and overall time. Results metrics have shown a significant difference in speed.en_US
dc.language.isoenen_US
dc.publisherAzerbaijan Journal of High Performance Computingen_US
dc.subjectTF-IDFen_US
dc.subjectBM-25en_US
dc.subjectApache sparken_US
dc.subjectInformation retrievalen_US
dc.subjectHPCen_US
dc.titleA SURVEY OF RETRIEVAL ALGORITHMS AND THEIR PARALLELIZATION IN LARGE-SCALE SYSTEMSen_US
dc.typeArticleen_US
dc.source.journaltitleAzerbaijan Journal of High Performance Computingen_US
dc.source.volume4en_US
dc.source.issue2en_US
dc.source.beginpage263en_US
dc.source.endpage266en_US
dc.source.numberofpages4en_US
Appears in Collections:Azerbaijan Journal of High Performance Computing

Files in This Item:
File Description SizeFormat 
doi.org.10.32010.26166127.2021.4.2.263.266.pdf387.38 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.