Show simple item record

dc.contributor.authorShah, Himat
dc.contributor.authorRezaei, Mohammad
dc.contributor.authorFränti, Pasi
dc.contributor.editorTavares, João Manuel RS
dc.date.accessioned2020-02-10T11:44:26Z
dc.date.available2020-02-10T11:44:26Z
dc.date.issued2019
dc.identifier.urihttps://erepo.uef.fi/handle/123456789/8021
dc.description.abstractWe present D-rank, an unsupervised, language and domain independent method for automatically extracting keywords from a single web page. The method does not use any corpus, and relies only on the information and features on the web page including page URL, word frequency, title, hyperlinks, and headers, which are extracted from DOM tree of the page. Different scores are assigned to the words according to their importance that is specified by their positions in the web page. Experimental results on web pages in three different languages show the effectiveness of the proposed method.
dc.language.isoenglanti
dc.publisherACM Press
dc.relation.ispartofAIIPCC '19: Proceedings of the International Conference on Artificial Intelligence, Information Processing and Cloud Computing
dc.relation.urihttp://dx.doi.org/10.1145/3371425.3371495
dc.rightsIn copyright 1.0
dc.titleDOM-based keyword extraction from web pages
dc.description.versionfinal draft
dc.contributor.departmentSchool of Computing, activities
uef.solecris.id67745320en
dc.type.publicationArtikkelit ja abstraktit tieteellisissä konferenssijulkaisuissa
dc.relation.doi10.1145/3371425.3371495
dc.description.reviewstatuspeerReviewed
dc.relation.articlenumber62
dc.relation.isbn978-1-4503-7633-4
dc.rights.accesslevelopenAccess
dc.type.okmA4
uef.solecris.openaccessEi
dc.rights.copyright© Association for Computing Machinery
dc.type.displayTypearticleen
dc.type.displayTypeartikkelifi
dc.rights.urlhttps://rightsstatements.org/page/InC/1.0/


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record