Blog Tagger

This Package can be used to extract keywords from a page to create tags for any blogs,news or any textual information available on the web page. These tags highlights the topic content by providing a glance of large volume of texts embedded in a page.Tag generation is an important feature in many sectors of IT such as Amazon uses tags for customer segmentation.

Prerequisites:

Install packages in the requirements.txt using pip install -r requirements.txt
Download the spaCy English model after installation:
python -m spacy download en_core_web_sm
Follow the instruction given below to use albert-base model from hugging face model hub, you can change the model but it might need some customization in source code. so albert model is adviced here to download.

model=TFAutoModel.from_pretrained('albert-base-v2')
tokenizer=AutoTokenizer.from_pretrained('albert-base-v2')

Usage Instructions

Clone the repository on local system
Collect web data

For example

from web_data import Blog_Data
data=Blog_Data("https://influencermarketinghub.com/12-best-food-blogs/") pass website
Text_data=data.text_prep(req=['h1', 'h2', 'h3', 'h4', 'p']) pass tags

Use main class Blog tagger to generate top k tags.

For example

tagger=Blog_Tagger(Text_data,maxlen=<int num>)
tagger.token_embedding_gen(model,tokenizer)
top_tokens=tagger.tag_gen(k)

Source Repository that contains package

Link : original repository

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
README.md		README.md
__init__.py		__init__.py
blog_keyword.py		blog_keyword.py
requirements.txt		requirements.txt
web_data.py		web_data.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Blog Tagger

Prerequisites:

Usage Instructions

Source Repository that contains package

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Blog Tagger

Prerequisites:

Usage Instructions

Source Repository that contains package

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages