Skip to main content

A Bibliometric-Based Semi-automatic Approach to Identification of Candidate Thesaurus Terms: Parsing and Filtering of Noun Phrases from Citation Contexts

  • Conference paper
Context: Nature, Impact, and Role (CoLIS 2005)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3507))

Abstract

The present study investigates the ability of a bibliometric based semi-automatic method to select candidate thesaurus terms from citation contexts. The method consists of document co-citation analysis, citation context analysis, and noun phrase parsing. The investigation is carried out within the specialty area of periodontology. The results clearly demonstrate that the method is able to select important candidate thesaurus terms within the chosen specialty area.

The present paper is sponsored by NORSLIS (Nordic Research School in Library and Information Science).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Soergel, D.: Indexing languages and thesauri: construction and maintenance. Melville, Los Angeles (1974)

    Google Scholar 

  2. Aitchison, J., Gilchrist, A., Bawden, D.: Thesaurus construction and use: A practical manual, 4th edn. Aslib, London (2000)

    Google Scholar 

  3. Blair, D.C., Kimbrough, S.O.: Exemplary documents: foundation for information retrieval design. Information Processing and Management 38, 363–379 (2002)

    Article  MATH  Google Scholar 

  4. Salton, G., McGill, M.: Introduction to modern information retrieval. MaGraw-Hill, New York (1983)

    MATH  Google Scholar 

  5. Schneider, J.W.: Verification of bibliometric methods’ applicability for thesaurus construction. PhD dissertation. Royal School of Library and Information Science, Aalborg (2004), Available http://biblis.db.dk/uhtbin/hyperion.exe/db.jessch04

  6. Moens, M.F.: Automatic indexing and abstracting of document texts. Kluwer Academic Publishers, Dordrecht (2000)

    Google Scholar 

  7. Katz, S.: Distribution of content words and phrases in text and language modelling. Natural Language Engineering 2(1), 15–60 (1996)

    Article  Google Scholar 

  8. Bookstein, A., Klein, S.T., Raita, T.: Clumping properties of content-bearing words. Journal of the American Society for Information Science and Technology 49(2), 102–114 (1998)

    Google Scholar 

  9. Small, H.: Cited documents as concept symbols. Social Studies of Science 8(3), 327–340 (1978)

    Article  Google Scholar 

  10. Garfield, E.: The citation index as a subject index. Current Contents May(18), 5–7 (1974)

    Google Scholar 

  11. Rees-Potter, L.K.: Dynamic thesaural systems: a Bibliometric study of terminological and conceptual change in sociology and economics with the application to the design of dynamic thesaural systems. Information Processing and Management 25(6), 677–691 (1989)

    Article  Google Scholar 

  12. Anick, P.G., Vaithyanathan, S.: Exploiting clustering and phrases for context-based information retrieval. In: Proceedings of the ACM/SIGIR Conference on Research and Development in Information Retrieval, Philidelhia, PA, pp. 314–323 (1997)

    Google Scholar 

  13. Schneider, J.W., Borlund, P.: Introduction to bibliometrics for construction and maintenance of thesauri: methodical considerations. Journal of Documentation 60(5), 524–549 (2004)

    Article  Google Scholar 

  14. Persson, O.: The intellectual base and research front of JASIS 1986-1990. Journal of the American Society for Information Science 45(1), 31–38 (1994)

    Article  Google Scholar 

  15. Small, H.: Co-citation in the scientific literature: a new measure of the relationship between two documents. Journal of the American Society for Information Science 24(4), 265–269 (1973)

    Article  Google Scholar 

  16. Sneath, P., Sokal, R.: Numerical taxonomy: The principles and practice of numerical classification. W.H. Freeman, San Francisco (1973)

    MATH  Google Scholar 

  17. Sparck Jones, K.: Automatic keyword classification for information retrieval. Butterworths, London (1971)

    Google Scholar 

  18. Small, H., Greenlee, E.: Citation context analysis of a co-citation cluster: Recombinant DNA. Scientometrics 2(4), 277–301 (1980)

    Article  Google Scholar 

  19. Small, H.: The synthesis of specialty narratives from co-citation clusters. Journal of the American Society for Information Science 37(3), 97–110 (1986)

    Google Scholar 

  20. Braam, R.R., Moed, H., van Raan, A.F.J.: Mapping of Science by combined Co-Citation and Word Analysis. I. Structural aspects. Journal of the American Society for Information Science 42(4), 233–251 (1991)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Schneider, J.W., Borlund, P. (2005). A Bibliometric-Based Semi-automatic Approach to Identification of Candidate Thesaurus Terms: Parsing and Filtering of Noun Phrases from Citation Contexts. In: Crestani, F., Ruthven, I. (eds) Context: Nature, Impact, and Role. CoLIS 2005. Lecture Notes in Computer Science, vol 3507. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11495222_18

Download citation

  • DOI: https://doi.org/10.1007/11495222_18

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-26178-0

  • Online ISBN: 978-3-540-32101-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics