AI-Powered Search Engine for Research Papers (Project for NLP)

Overview

This project aims to build an intelligent search engine that understands the semantic meaning of research queries and ranks documents based on abstract importance and key concept extraction.

Concepts Used

Sentence Transformers – Understanding query semantics
LSTM, RNN – Document ranking based on abstract importance
CBOW Embeddings – Extracting key concepts from papers
Named Entity Recognition (NER) – Extracting citations, authors, and key entities

Datasets

Features

✅ Semantic Search: Uses Sentence Transformers to improve query understanding
✅ Intelligent Ranking: LSTM-based ranking of research papers
✅ Concept Extraction: CBOW embeddings identify key topics
✅ Entity Recognition: NER extracts authors, citations, and key entities

🛠 Tech Stack

Language Models: BERT, Sentence Transformers
Deep Learning: LSTM, RNN, PyTorch/TensorFlow
NLP Techniques: Named Entity Recognition (NER), Word Embeddings (CBOW)
Database: PostgreSQL / MongoDB for storing research papers
Backend: FastAPI / Flask
Frontend: Next.js, Typescript, TailwindCSS

General Instructions before installing locally

Make sure Nodejs and python are installed before following the next steps
To check for Nodejs and Python run these commands on command prompt
```
node -v
```
```
python --version
```
If any problem understanding the folder structure ask me
arxiv_metadata.json was put in .gitignore because it was 4GB's big download it locally and put it in search_engine/data/raw folder

Installation

Clone the repository:

git clone https://github.com/Augnik03/ResearchAI.git
cd ResearchAI

For working on the frontend:
```
cd frontend
npm i
```

If error occurs run this command:

npm i --legacy-peer-deps or npm i --force

To run development server:
```
npm run dev
```

For working on the backend:
```
cd search_engine
```

Before installing python dependencies make sure to create a virtual environment:
```
python -m venv venv
venv/Scripts/activate
```

Install dependencies:
```
pip install -r requirements.txt
```
To run scripts on the dataset:
```
python preprocess.py
```

Contributing

Contributions are welcome! Fork the repo, create a feature branch, and submit a PR.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
frontend		frontend
search_engine		search_engine
Readme.md		Readme.md
steps.txt		steps.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI-Powered Search Engine for Research Papers (Project for NLP)

Overview

Concepts Used

Datasets

Features

🛠 Tech Stack

General Instructions before installing locally

Installation

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI-Powered Search Engine for Research Papers (Project for NLP)

Overview

Concepts Used

Datasets

Features

🛠 Tech Stack

General Instructions before installing locally

Installation

Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages