Info Miner

Transféré par

SaiKrishnaReddy

0% ont trouvé ce document utile (0 vote)

72 vues3 pages

Abstract for Info miner project

Copyright

Formats disponibles

PDF, TXT ou lisez en ligne sur Scribd

Partager ce document

Partager ou intégrer le document

Options de partage

Avez-vous trouvé ce document utile ?

Ce contenu est-il inapproprié ?

Signaler ce document

Abstract for Info miner project

Droits d'auteur :

Formats disponibles

Téléchargez comme PDF, TXT ou lisez en ligne sur Scribd

Signaler comme contenu inapproprié

0% ont trouvé ce document utile (0 vote)

72 vues3 pages

Info Miner

Transféré par

SaiKrishnaReddy

Abstract for Info miner project

Droits d'auteur :

Formats disponibles

Téléchargez comme PDF, TXT ou lisez en ligne sur Scribd

Signaler comme contenu inapproprié

Passer à la page

Vous êtes sur la page 1sur 3

Rechercher à l'intérieur du document

PROJECT PRESENTATION COMPETITION

APOGEE 2011
ABSTRACT

COLLEGE NAME:

Manipal Institute of Technolgy, Manipal

TITLE OF PROJECT:

InfoMiner

TEAM LEADER :

Syed Aqueel Haider 9008420619 aqueel.h.rizvi@gmail.com

TEAM MEMBERS

Rishabh Mehrotra 9014516301 erishabh@gmail.com
(BITS Pilani)

ABSTRACT

TITLE OF PROJECT:

InfoMiner

CATEGORY PREFERENCE

Software Design (Adaptive Technology)

OBJECTIVE :

To develop a Business Intelligence model which automatically crawls the web for news
articles and after detecting corporate news articles, find the company being talked about in
those articles.

IMPLEMENTATION METHODOLOGY:

Our project is divided into various modules:
Automatically extracting/crawling news articles from the web
Classifying these news articles as corporate or non-corporate
Using Natural language Processing tools to find the name of the organization which is
being talked about in the news article.
We use Nutch crawler to crawl the web for news articles and pre-process it by POS(Part-Of-
Speech) tagging and NER(Named Entity Recognition) parser to extract features for training
model. We use Support Vector Machine (LIBSVM toolkit) to train our classifier. All NLP
techniques are implemented in Java.

APPLICATION :
In this era of information overload, we require intelligent systems that can read, interpret and
analyze information themselves. Our project is one which fulfils all these parameters.
All companies need to be aware of their rivals as to what all things they are involved in,
where on the web are they being talked about etc. Our project provides them with all they
need to know about all other companies.
This project finds major applications in Business Intelligence.

JUSTIFY CHOICE OF CATEGORY:
We use Machine Learning, specifically Support vector Machines, to train our classifier which
automatically classifies corporate and non-corporate news articles. Also our system after
extracting news articles, learns itself and is intelligent enough to find the name of the
organization which is being talked about in the news article.
Thus our system evolves an intelligence of its own and has a decision making capability using
which it detects the main organization being talked about in the news. So it is fit for Adaptive
Technology.

BASIC EXPLANATION OF THE PROJECT:
With the rapid advancements in the field of information technology, the amount of
information available has increased tremendously. News articles constitute the largest
available portion of factual information about events happening in the world. Corporate news
constitutes a major chunk of these news articles. Such news is related to a wide range of
events such as acquisitions, mergers, Shares/stock performances, product launches, executive
changes, projects, legal proceedings, among others. Now this is a huge amount of information
and can be spread on the internet in a haphazard way. However, once organized in a
systematic manner, this pool of information becomes potentially a very good resource for
various tasks like analyzing the market trends of companies, helping in corporate decision
making, tracking the activities of rival companies etc.
This project finds a way of identifying corporate news from a collection of news articles and
then pairing the news with the organization/company which is being talked about in the
article. The model is capable of differentiating the main organization (which is the focus of
the news) from other organizations which find mention.

Vous aimerez peut-être aussi

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
D'Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Évaluation : 4 sur 5 étoiles
4/5 (5794)
JavaScript Commands
Document1 page
JavaScript Commands
SaiKrishnaReddy
Pas encore d'évaluation
The Little Book of Hygge: Danish Secrets to Happy Living
D'Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Évaluation : 3.5 sur 5 étoiles
3.5/5 (399)
6th Central Pay Commission Salary Calculator
Document15 pages
6th Central Pay Commission Salary Calculator
rakhonde
100% (436)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
D'Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Évaluation : 3.5 sur 5 étoiles
3.5/5 (231)
Texas Drivers Handbook - 2013 (PDF File)
Document90 pages
Texas Drivers Handbook - 2013 (PDF File)
DMV_exam_GUIDE_com
33% (3)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
D'Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Évaluation : 4 sur 5 étoiles
4/5 (894)
Introduction to Algorithms Solutions Guide
Document25 pages
Introduction to Algorithms Solutions Guide
SaiKrishnaReddy
Pas encore d'évaluation
The Yellow House: A Memoir (2019 National Book Award Winner)
D'Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Évaluation : 4 sur 5 étoiles
4/5 (98)
All Hwex
Document532 pages
All Hwex
Hai S Le
Pas encore d'évaluation
Shoe Dog: A Memoir by the Creator of Nike
D'Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Évaluation : 4.5 sur 5 étoiles
4.5/5 (537)
Musical
Document4 pages
Musical
Gwynbleidd
Pas encore d'évaluation
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
D'Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Évaluation : 4.5 sur 5 étoiles
4.5/5 (474)
Oce Arizona 350XT Brochure
Document2 pages
Oce Arizona 350XT Brochure
marcelcoopers
Pas encore d'évaluation
Never Split the Difference: Negotiating As If Your Life Depended On It
D'Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Évaluation : 4.5 sur 5 étoiles
4.5/5 (838)
Managing ICT Solutions A2
Document7 pages
Managing ICT Solutions A2
pablo Sandoval
100% (1)
Grit: The Power of Passion and Perseverance
D'Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Évaluation : 4 sur 5 étoiles
4/5 (587)
ANSYS Fluent Tutorial Guide 2020 R2 PDF
Document1 056 pages
ANSYS Fluent Tutorial Guide 2020 R2 PDF
Natalia Moreno
75% (12)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
D'Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Évaluation : 4.5 sur 5 étoiles
4.5/5 (265)
WER77DC TMP Appcompat
Document5 pages
WER77DC TMP Appcompat
Anonymous HKFMsnGAF
Pas encore d'évaluation
Yes Please
D'Everand
Yes Please
Amy Poehler
Évaluation : 4 sur 5 étoiles
4/5 (1891)
CICS For Iseries Problem Determination
Document124 pages
CICS For Iseries Problem Determination
Kuzumich
Pas encore d'évaluation
Angela's Ashes: A Memoir
D'Everand
Angela's Ashes: A Memoir
Frank McCourt
Évaluation : 4.5 sur 5 étoiles
4.5/5 (440)
Final Draft Senior Seminar Project
Document27 pages
Final Draft Senior Seminar Project
api-251794642
Pas encore d'évaluation
The Emperor of All Maladies: A Biography of Cancer
D'Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Évaluation : 4.5 sur 5 étoiles
4.5/5 (271)
Paddy Proposal v1
Document12 pages
Paddy Proposal v1
davidglits
Pas encore d'évaluation
On Fire: The (Burning) Case for a Green New Deal
D'Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Évaluation : 4 sur 5 étoiles
4/5 (73)
LTO and CPR Processing
Document2 pages
LTO and CPR Processing
verkie
100% (9)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
D'Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Évaluation : 4.5 sur 5 étoiles
4.5/5 (344)
Google Chrome Default Cookies
Document104 pages
Google Chrome Default Cookies
Andrew Shevchenko
Pas encore d'évaluation
Team of Rivals: The Political Genius of Abraham Lincoln
D'Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Évaluation : 4.5 sur 5 étoiles
4.5/5 (234)
StoneLock Pro User Manual v. 17.02.01 PDF
Document85 pages
StoneLock Pro User Manual v. 17.02.01 PDF
Antonio G
Pas encore d'évaluation
Fear: Trump in the White House
D'Everand
Fear: Trump in the White House
Bob Woodward
Évaluation : 3.5 sur 5 étoiles
3.5/5 (738)
Powerful 4K Kahuna 9600 production switcher
Document4 pages
Powerful 4K Kahuna 9600 production switcher
Naveen Gopi
Pas encore d'évaluation
The Glass Castle: A Memoir
D'Everand
The Glass Castle: A Memoir
Jeannette Walls
Évaluation : 4.5 sur 5 étoiles
4.5/5 (1712)
Class 12 CSC Practical File Mysql
Document21 pages
Class 12 CSC Practical File Mysql
S JAY ADHITYAA
100% (1)
Rise of ISIS: A Threat We Can't Ignore
D'Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Évaluation : 3.5 sur 5 étoiles
3.5/5 (137)
FxExtractor Operation
Document6 pages
FxExtractor Operation
MirevStefan
Pas encore d'évaluation
Principles: Life and Work
D'Everand
Principles: Life and Work
Ray Dalio
Évaluation : 4 sur 5 étoiles
4/5 (599)
CUSTOMIZING THE KEYBOARD
Document5 pages
CUSTOMIZING THE KEYBOARD
engelect2065
Pas encore d'évaluation
The Unwinding: An Inner History of the New America
D'Everand
The Unwinding: An Inner History of the New America
George Packer
Évaluation : 4 sur 5 étoiles
4/5 (45)
Computer Network Q - A Part-1
Document7 pages
Computer Network Q - A Part-1
Avi Dahiya
Pas encore d'évaluation
The World Is Flat 3.0: A Brief History of the Twenty-first Century
D'Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Évaluation : 3.5 sur 5 étoiles
3.5/5 (2219)
LLVM Crash Course
Document15 pages
LLVM Crash Course
Lauren Huang
Pas encore d'évaluation
Steve Jobs
D'Everand
Steve Jobs
Walter Isaacson
Évaluation : 4.5 sur 5 étoiles
4.5/5 (806)
HRMS Comparison Guide
Document160 pages
HRMS Comparison Guide
Sudheer Reddy Reddypalli
Pas encore d'évaluation
John Adams
D'Everand
John Adams
David McCullough
Évaluation : 4.5 sur 5 étoiles
4.5/5 (2409)
Multiple Choice Questions Elective - II Information Systems Management - V Knowledge Management
Document2 pages
Multiple Choice Questions Elective - II Information Systems Management - V Knowledge Management
Shubhada Amane
Pas encore d'évaluation
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
D'Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Évaluation : 4 sur 5 étoiles
4/5 (1090)
Crew Visas (C1D) : Step 1: Prepare
Document10 pages
Crew Visas (C1D) : Step 1: Prepare
LýMinhTiến
Pas encore d'évaluation
Bad Feminist: Essays
D'Everand
Bad Feminist: Essays
Roxane Gay
Évaluation : 4 sur 5 étoiles
4/5 (1015)
RCI-1550 HRT Lattice Boom System Instruction Manual: MAN-1075 Rev D
Document85 pages
RCI-1550 HRT Lattice Boom System Instruction Manual: MAN-1075 Rev D
Gaetano Frulio
Pas encore d'évaluation
The Outsider: A Novel
D'Everand
The Outsider: A Novel
Stephen King
Évaluation : 4 sur 5 étoiles
4/5 (1839)
CIS552 Indexing and Hashing 1
Document56 pages
CIS552 Indexing and Hashing 1
Vinay Varma
Pas encore d'évaluation
Brooklyn: A Novel
D'Everand
Brooklyn: A Novel
Colm Tóibín
Évaluation : 3.5 sur 5 étoiles
3.5/5 (1937)
InterSystems SDA
Document52 pages
InterSystems SDA
berhanu92
Pas encore d'évaluation
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
D'Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Évaluation : 4.5 sur 5 étoiles
4.5/5 (119)
CMP507 Computer Network
Document227 pages
CMP507 Computer Network
Sufiyan Mogal
0% (1)
A Man Called Ove: A Novel
D'Everand
A Man Called Ove: A Novel
Fredrik Backman
Évaluation : 4.5 sur 5 étoiles
4.5/5 (4609)
Carbon Coder User Guide
Document97 pages
Carbon Coder User Guide
Veres András
Pas encore d'évaluation
The Light Between Oceans: A Novel
D'Everand
The Light Between Oceans: A Novel
M.L. Stedman
Évaluation : 4.5 sur 5 étoiles
4.5/5 (789)
Unit 5 Machine Learning With PU Solution
Document68 pages
Unit 5 Machine Learning With PU Solution
Kavi Raj Awasthi
Pas encore d'évaluation
The Woman in Cabin 10
D'Everand
The Woman in Cabin 10
Ruth Ware
Évaluation : 3.5 sur 5 étoiles
3.5/5 (2322)
Build cross-platform mobile apps with React Native
Document31 pages
Build cross-platform mobile apps with React Native
dvirus2012
Pas encore d'évaluation
Manhattan Beach: A Novel
D'Everand
Manhattan Beach: A Novel
Jennifer Egan
Évaluation : 3.5 sur 5 étoiles
3.5/5 (792)
User Manual Topspin ts35 PDF
Document260 pages
User Manual Topspin ts35 PDF
Srinivas Penumutchu
Pas encore d'évaluation
The Perks of Being a Wallflower
D'Everand
The Perks of Being a Wallflower
Stephen Chbosky
Évaluation : 4.5 sur 5 étoiles
4.5/5 (2099)
Introduction To Image Processing Using Matlab
Document85 pages
Introduction To Image Processing Using Matlab
Akankhya Behera
Pas encore d'évaluation
Wolf Hall: A Novel
D'Everand
Wolf Hall: A Novel
Hilary Mantel
Évaluation : 4 sur 5 étoiles
4/5 (3811)
Script Mafia
Document2 pages
Script Mafia
Hassan Fethi
Pas encore d'évaluation
Little Women
D'Everand
Little Women
Louisa May Alcott
Évaluation : 4 sur 5 étoiles
4/5 (104)
The Art of Racing in the Rain: A Novel
D'Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Évaluation : 4 sur 5 étoiles
4/5 (4200)
Sing, Unburied, Sing: A Novel
D'Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Évaluation : 4 sur 5 étoiles
4/5 (1103)
A Tree Grows in Brooklyn
D'Everand
A Tree Grows in Brooklyn
Betty Smith
Évaluation : 4.5 sur 5 étoiles
4.5/5 (1929)
The Constant Gardener: A Novel
D'Everand
The Constant Gardener: A Novel
John le Carre
Évaluation : 3.5 sur 5 étoiles
3.5/5 (104)
Her Body and Other Parties: Stories
D'Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Évaluation : 4 sur 5 étoiles
4/5 (821)