Bibliographic & Citation Databases:
Scholarly Information Sources for Academic Researcher’s
Shivaram BS, PhD
Joint Head, ICAST
CSIR-National Aerospace Laboratories, Bangalore
shiavram@nal.res.in
Academic Researcher’s Life
Personal Life
•Classes
Management •Examination duties
•Branding •Seminar/Conferences
•Ranking
•Accreditation •Committees
•ROI •Meetings
•Admissions
•Institution activities
•Societal Role
Academic Researcher’s Clock
Academic 6
Research Literature
Writing, 20% Search, 20%
Literature
Review, 10%
Research Data Analysis,
20%
4 Research
Design, 10%
Data Collection,
20%
Personal
14
Publication Metrics
Research & Publishing Publication
Misconduct
Research Writing
Reference
Management
Literature Search
Validation
Experimentation/Mo
deling / Simulations
Problem Definition
Literature Search
Avenues for Scholarly Information/Literature (Bib sources)
Past Present
Information Retrieval on Internet
Digital Content
Tools for Web Information Retrieval
• Search Engines (Generic Information)
– Meta Search Engines
– Specialty Search Engines
– Web Directories
– Portals & Gateways
• Scholarly Databases
– Bibliographic databases
– Citation Databases
– Patent Databases
– Digital Library Platforms
– Open Access Literature Platforms
Search Engines (A layman approach)
Search Engine: is a program that searches for
keywords specified by the user, in the databases
of websites on the World Wide Web About 8,41,00,000 results
Top Search engines
Google (95% uses) https://www.google.com/
Bing http://www.bing.com/
Ask http://www.ask.com/
Yahoo https://www.yahoo.com/
Lycos http://www.lycos.com/ About 71,500,000 results
DuckduckGo (Indian, No tracking) https://duckduckgo.com/
Yandex (Translator) https://www.yandex.com/
Entireweb http://www.entireweb.com/
Gigablast (Interesting features) http://www.gigablast.com/
Meta Search Engines (A layman approach)
Meta search Engines: take input from a
user and simultaneously send out queries
to third party search engines for results
Top Meta Search engines
WebCrawler http://www.webcrawler.com/
Dogpile http://www.dogpile.com/
Info.com http://www.info.com/
Startpage https://www.startpage.com/
eXicte http://www.excite.com/
zoo http://www.zoo.com/
Search.com http://www.search.com/
Yippy http://www.yippy.com/
Mamma https://mamma.com/
Infospace http://infospace.com/
Specialty Search Engines (A layman approach)
Specialty search Engines: take input from a user and
simultaneously send out queries to public search engines for
results & organizes search results into clusters; offers better
visualizations
Top Specialty Search engines
Carrot2 https://search.carrot2.org/#/search/web
https://millie.northernlight.com/dashboardfolder.php
Millie
Search strategy in Google
Phrase Search:
General To narrow
Search down
: 95% Noise
Google Advanced
Restricting Search Options
by Filetype: pdf
I am an Academician/Researcher
Interested in scholarly literature!!! How
to get????
Literature (Information) Band
Retrievals by General Search Engines
Books
Adams R J, Smart P and Huff A S, Shades of grey: guidelines for working with the grey literature in
systematic reviews for management and organizational studies, International Journal of Management
Reviews, 19(4) (2017) 432–454
Scholarly Information
• Information created in the course of research activities
• Information published by scholars to inform their learning
/ research findings
• Information which is undergone a rigorous review process
by peers in their discipline
• Published in regular publishing framework – Commercial,
societies, Open access, so on
Scholarly Information: Document types
• Journal articles
– Review articles
– Original Research articles
– Case study
– Rapid communications
• Conference papers
• Books / Book chapters
• Government reports
• Case Studies reports
Scholarly Information growth
• Global scientific output doubles every nine years (Nature News Blog dated 07
May 2014 by Richard Van Noorden)
• 33100 English Language and 9400 non English Language Peer
reviewed journals adding over 3 million articles every year (STM Report
2018)
• Scholarly Literature (Source: Web of Knowledge platform)
Journal & Conference Patents E-Books Data sets
Papers
155 million 39.3 million patent families with More than one lakh 7.3 million
more then 70 million patents
Thanks to ICT, most of them are available Online
How do I Trust Web Information (Research)
Follow CRAAP Model
Scholarly Information: Access Modes
Offline Mode Online (Digital) Mode
Libraries Subscribed
STM Publishers
Content
Personal
Gold OA
Collection Open Access
Content Green OS
Archival
Centers Author Profiles /
Homepages
Free Content
Academic Social Media
Platforms
Scholarly Information Discovery Platforms
Grey
Scholarly Data Repositories literature E-Books
Search
Engines Bib.
Open Access Databases Report
Library content servers
OPACs Patent Reference
Resources Management
platforms
Digital
E-print Libraries
servers
Publisher
Aggregators platforms
Datasets
Thesis &
Dissertations
Servers Manuals
Scholarly Search Engines
• Specialty Search Engine Examples
• Google Scholar
• Academic Search – https://scholar.google.co.in/
Engines • Microsoft Academic Search
– http://academic.research.microsoft.com/
• Restricted to Scholarly • CrossRef Metadata Search
– http://www.crossref.org/
Content
• Semantic Scholar – AI Powered
• Add on functionalities – https://www.semanticscholar.org/
• Gettheresearch
• Powerful search – https://gettheresearch.org/
functionalities • BASE (Open Access articles)
– http://www.base-search.net/
Search Engine: General Vs Scholar
Google Scholar
Google Always recommend scholarly search engine
Google Scholar: Tips
Link to available Full text
Cited by: links to all articles list who has cited
Related articles: Brings you related articles
All Versions: links all available places where details of the article present
Cite: Exports Citation of the article (MLA, APA, Chicago, Harvard) (Bibtext, Refman, Endnote, RefWorks)
Save: will save to your Google scholar library
Demonstration
CrossRef
• Not-for-profit membership organization for scholarly publishing
to make content easy to find, cite, link, and assess
BASE
• 100 Million documents from 5000 sources, 60% is open access content
• Contain Metadata of academically relevant resources - journals, institutional
repositories, digital collections etc
• Indexed only document servers which matches the quality criteria of BASE
• Discloses web resources of the "Deep Web“ which commercial search engine
fails
• Excellent Refining filters (browse by Library Classification Number)
• BASE is an OAI Service provider, it can be integrated to local collection –
Federated search, Discovery
BASE: https://www.base-search.net/
BASE Search Results
Semantic Scholar
• 20+ Million digital items across all
disciplines
• Profile based functionalities
• Citation tracking
• Citation / reference export
functionalities
• Setup library (Personal collection)
• Automatic Alerts
https://www.scienceopen.com/
81 Million Articles, 25 K Journals, 3200 publishers
Advance Search, Filtering Options, OA articles, References Export,
Altmetrics
Database of bibliographic records, an organized digital collection of
references to published literature which includes journal articles,
conference proceedings, reports, patents, books, etc.
• Subject Specific
• Platform for comprehensive
literature search
• Wider Coverage
• CDs / DVDs / Web Version
• Powerful search interface
Engineering: Engineering Village (Combination of Databases)
• Provides access to 12 engineering
document databases
• Published by Elsevier (Commercial)
• 190 engineering disciplines & 73
countries
• 3,800+ journals from 1,988 publishers
• 117 trade magazines
• 80,000+ conference proceedings & 83
book series
• Link to Full text Articles
• Created by the Institution of Engineering and
Technology (IET)
• Service Provided by EBSCO (Commercial)
• Subject Coverage: physics, electrical engineering, electronics,
communications, control engineering, computing, information technology,
manufacturing, production and mechanical engineering
• Coverage: 30+ Million articles from 4500 Journals published by 500+
Publishers
• Inspec : also indexes more than 6 million conference
items, plus preprints, books, dissertations, patents,
reports and videos
• Inspec Analytics: helps to know the research trend
• Inspec Archive: Science abstracts from 1898-1968
• Published CAS a division of American Chemical
Society (ACS)
• Access to the world’s most reliable and
comprehensive chemical and scientific information –
Rigorous quality check
• Powerful Smartsearch technology
– Substance Search
– Structure Search
– Chemical Properties & reaction Search
• Technology Trends
kind of bibliographic database, an index of citations between publications, allowing the user to
establish which later documents cite which earlier documents. Can generate citation profiles for
authors, organizations….
Citation Databases
Dimension
CiteseerX Citation
Data
Crossref
Web of Google
SCOPUS
Knowledge Scholar
Clarivate
Elsevier Google
Analytics
Web of Knowledge (WoS)
• Oldest Citation Database – covers 115 years of the
highest-quality research data
• Publisher-neutral : A robust evaluation and Curation
process by a team of experts
– 28 Quality Criteria for Journals
» 24 Editorial Criteria
» 4 Impact Criteria
• Discipline wise (24+ K )
– Science Citation Index Expanded (SCIE) – Web of Science -
– Social Sciences Citation Index (SSCI)
– Arts & Humanities Citation Index (AHCI)
– Book Citation Index (BKCI)
– Conference Proceedings Citation Index (CPCI)
• 1.18 billion cited references from over 53 million
records – WoS
• Source for Journal Impact Factor
Web of Knowledge (WoS)
Independent Editorial Team
Web of Knowledge (WoS)
Engineering Electrical Electronic
736192 Materials Science Multidisciplinary
755453
3098341
763593 Physics Applied
771666 Biochemistry Molecular Biology
863471 Chemistry Multidisciplinary
Chemistry Physical
873447 2150652
Oncology
Neurosciences
885307
Environmental Sciences
Pharmacology Pharmacy
887540 1740709
Surgery
Computer Science Theory Methods
894217
Optics
Computer Science Artificial Intelligence
895966 1730392
Clinical Neurology
Multidisciplinary Sciences
975778
Telecommunications
1376138 Computer Science Information Systems
1004546
Cell Biology
Physics Condensed Matter
1016656 1304532
Medicine General Internal
1039893 1250651
Immunology
1070656 Cardiac Cardiovascular Systems
1144692
1092584
Public Environmental Occupational Health
SCOPUS
• Launched in 2004 by Elsevier
• Citation database of peer-reviewed literature
• Helps to track, analyze and visualize research
• Content sources : Journals, Books, Conference
Proceedings
– 39,743 Serial titles
• Over 25000 active
• 14,558 – Inactive
– 210000+ book titles
– 5000+ Publishers
• Integrated with ORCID
• SciVal – Advanced analytics solution for
Research Evaluation
SCOPUS: Content Selection Process
• Content Recommended by Content Selection and Advisory Board (CSAB)
• Review new titles using both quantitative and qualitative measures
Journal Policy Convincing editorial policy; Type of peer review; Diversity in geographical
distribution of editors ; Diversity in geographical distribution of authors
Content Academic contribution to the field; Clarity of abstracts; Quality of and
conformity to the stated aims and scope of the journal; Readability of
articles
Journal Standing Citedness of journal articles in Scopus; Editor standing
Publishing Regularity No delays or interruptions in the publication schedule
Online Availability Full journal content available online; English language journal home page
available; Quality of journal home page
Metric Self-citation rate, Total citation rate; CiteScore
Publication concerns Publication Ethics
SCOPUS: Subject & Publisher Coverage
Subject wise
Publisher wise
SCOPUS
• Search by document, author or affiliation, or use Advanced
Search
• Track citations over time for a set of authors or Institutions or
documents using Citation Overview
• Assess research trends with Analyze Module
• View h-index for specific authors / institution
• Analyze an author’s publishing output and research impact
with Author Evaluator
• Gain insight into journal performance with Compare Journals-
multiple metrics, including CiteScore, SNIP and SJR
Aggregators
Databases of full-text articles, defined by subject area and sold as a single product, rather than as
individual subscriptions.
• Ingentaconnect: (http://www.ingentaconnect.com/ )
• 10000 publications from 290+ publishers
• 630 Engineering titles
• ProQuest: http://www.proquest.com
• 9000 publishers
• Project MUSE: http://muse.jhu.edu/
• 240 Publishers in Humanities and social sciences
• JSTOR: www.jstor.org
• 214 titles from 48 publishers + Ebooks
• Highwire Press: http://home.highwire.org/ Open Access Article
• 3000 scholarly journals and thousands of scholarly books
Publisher Platforms
• Sciencedirect
• Springerlink
• Wiley
• Emerald
• IEEE Digital Library
• ASME/ACS/ Subscribed Content
Many more!!! Encourage users to create Profile
How do I Find books published in my field ??
Library OPACs – Free to access
• Library of Congress
– 17 million book titles
(https://catalog.loc.gov/ )
• Indcat – Inflibnet
– 8.19 Million books from 176 Indian
universities (Indian books)
(https://indcat.inflibnet.ac.in/)
• College OPAC
Fulltext E-Books- Digital Libraries
• Internet Archive Books
– 1 million full-text books
(https://archive.org/details/internetarchivebooks
• National Digital Library
– 3.9 Million books (World e-book library)
(https://ndl.iitkgp.ac.in/ )
• Google Books (Project Ocean)
– 30 + Million books
– Free fulltext Access to part of the collection
(https://books.google.com/)
Patent Databases
Patent Information
• Information found in patent applications and granted
patents.
• Patent information includes
– Bibliographic data
– Abstract
– Description
– Claims
– Drawings
• Patent information is publicly discloses the newly
developed technologies
• Patent information helps to develop new technical
solutions by other inventors
Prior Art Search
• All public information available prior to the date of filing of the
relevant patent application against which the patentability of the
invention will be determined.
– Journal Articles, Conference Papers, etc Information not considered in
– Report literature prior art
– Patents (Filed & Granted)
• Non-public Information
• Existing relevant technology
• Trade Secrete
• Traditional Knowledge / Oral disclosures
• Documents in internal use /
• Novelty/Non-obviousness circulation
• First to File/First to Invent
Types of Prior Art Search
Novelty Search: to find novelty / non-obvious.
Patentability Search: ascertain the chance or likelihood of an invention
getting a patent.
Infringement search make sure that nobody without your consent makes,
uses, or sells your patented invention.
Validity / Invalidity conducted after the issuance of patent to validate the
Search enforceability of a patent’s claims.
Patent Landscape To know business, scientific and technological trends in
the area / domain
Whitespace analysis To know the little or no patenting activity.
Why Novelty Search?
• Large Investment
• High cost in maintaining patents
• Helps to find out novelty of research by
comparing prior inventions
• Helps to identify White spaces
• Helps in future R & D Strategy and Decision
making
• To avoid Future litigation
Patent Databases
Free Databases Commercial Databases
•PATENTSCOPE
•Google Patents
•Lens.org •Thomson Innovations
•USPTO •Questel Orbit
•Espacenet •XLPAT
•Country Specific •IEEE Innovation Q Plus
•Japan – PAJ
•PATSNAP
•Germany- DPMA Register
•Patbase
• India - inPASS
•Freepatentonline
Not Possible Possible
White Space Analysis based on Patent Landscape Search
• White-spaces are
gaps in a technology
landscape.
• “White Space” is the
area with little or no
patenting activity.
• White-space analysis
is used as methods
for strategic product
innovation
Patent Landscaping : Trends
Patent Information (Structure)
• Each Information field is denoted by Numerical code
• First Page Information (Descriptive information)
• Patented country
• Patent Number
• Bibliographic Details
• Title
• Inventors
• Assignee
• Application Number
• Cited references
• Abstract
Patent Information (Structure)
• Drawings
– Parts named with numbers which are cross
referred in detailed description
• Field of Invention
• Background of Invention (Prior art data)
• Summary of Invention
– The objects
– Problems solved
• Detailed description of specification
• Claims
– Independent claims
– Dependent claims
VTU Consortium
• Started in 2013-14
• Unique & Successful model in providing
Access to Scholarly Content
• Mantra: Equity in Access
• Self Financed – Student Fee Model
• Multidisciplinary – Major STM Publisher
Coverage
• On Campus- IP Based Access, Off-Campus
through MAMY ACCESS platform
• It is emulated by Many Technological
Universities like- AKTU, JNTUA, KTU etc
VTU Consortium Journal Collection
VTU Consortium E-Books Collection
VTU Consortium Technology Platforms
Resources Available at VTU Campus
IEL Online Citation Databases
(Entire IEEE Platform) SCOPUS & WoS
Patent Database
Orbit Intelligence
VTU Consortium 24X7 Access
https://vtulib.mapmyaccess.com/resources
Quick Recap
• Web Information Retrieval platforms
– Search Engines, Scholarly Search Engines,
• Bibliographic Databases
– Engineering Village, INSPEC, CAS (SciFinder)
• Citation Databases
– SCOPUS, Web of Science
• Patent Databases
– Free & Paid Patent Databases
• VTU E-Resources Consortium
Analysing Title
Statement of Problem: Optimization of Hydrogen Fuel Cells for Electric Aircraft
Tertiary Keyword Primary Keyword Secondary Keyword
Identify Key Words Optimization Hydrogen Fuel Cell Electric Aircraft
Synonyms / Alternate /
Similar words
Electric Airplane
Improve Electrochemical cell
Battery powered
Advancement Hydrogen Energy
Electric Aeroplane
upgrade Battery Technology
Search Operators
Search Operator Operator Function
Boolean Operators
Simple Operators
AND Combine Keywords – Narrow down the Search (Limit results)
Batteries AND Aircraft
OR Either Keywords – Expanding the search (Retrieves more results)
Aircraft OR Airplane OR Aeroplane
NOT Excludes Keywords – Selective Exclusion (Limit Results)
Civilian Aircraft NOT Combat Aircraft
Advanced Operators
NEAR Search near by contextual or Related words
(Aircraft NEAR/4 Civilian)
() Used to group together words or phrases or Boolean operators
Ex. (Dogs AND ((rabies OR rabid) NOT (domestic OR Pet) ))
Search Operators
Search Operator Operator Function
Phrase Search
“” Combining keywords – Narrow down search
“Hydrogen Fuel Cell”
Truncated Search
Wild Characters Truncate Keyword
*? Network* includes Network, Networking, Networks, Networked
Field Based Search
Searching Restrict Search for relevant results
Metadata Fields Title, Abstract, Keywords, Journal Name etc.
Proximity Search
NEAR, WITHIN, Two or more separate Keywords occurrences are within a specified distance
PRE
Search Operators
Search Operator Operator Function
Advanced Search
Combination of all TITLE-ABS ( "FUEL CELL" ) AND ( aircraft OR aeroplane ) AND NOT ( "Combat
aircraft" )
search Operators
Search Approach
Statement of Problem: Optimization of Hydrogen Fuel Cells for Electric Aircraft
Generic Topic Specific Topic
Broader to Narrower Narrower to Broader
Electric Mobility Electric Aircraft
Fuel Cells Hydrogen Fuel Cells
SCOPUS :Demo (https://scopus.com )
Search Operations
Simple Search Field based Search Advanced based Search
• Document • Field based search
• Author
• Constructing advanced Query
• Researcher Discovery • Filters
• Affiliation
Search Operators
Boolean Operators Phrase Search Truncation
SCOPUS : Simple Search
SCOPUS : Author Search
SCOPUS : Filed Based Search
SCOPUS : Search
SCOPUS : Advanced Search
SCOPUS : Export Options
SCOPUS : Full-text Options
WoS :Demo
Search Operations
Simple Search Field based Search Advanced based Search
• Document • Field based search
• Cited Reference • Constructing advanced Query
• Structure • Filters
Search Operators
Boolean Operators Phrase Search Truncation
WoS : Simple Search
WoS : Structure Search
WoS : Author Search
WoS : Field Search
WoS : Filters Search
20
Filter
s
WoS : Advanced Search
WoS : Export References
WoS : Fulltext Options
BASE :Demo (https://www.base-search.net/ )
Search Operations
Simple Search Field based Search Advanced based Search
• Field based search
• Keyword Search
• Constructing advanced Query
• Browse • Filters
Search Operators
Boolean Operators Phrase Search Truncation
Patent Database : PATENTSCOPE
Patent Database : PATENTSCOPE
Patent Database : PATENTSCOPE
Next Class
Managing Bibliographic records / References:
Reference Management Tools