Académique Documents
Professionnel Documents
Culture Documents
No CONTENTS Page No
ACKNOWLEDGEMENT i
SYNOPSIS ii
1. INTRODUCTION
1.1 ORGANIZATION PROFILE 1
2. SYSTEM CONFIGURATION
2.1 HARDWARE CONFIGURATION 6
2.2 SOFTWARE CONFIGURATION 6
2.3 SOFTWARE FEATURES 7
3. SYSTEM STUDY
3.1 EXISTING SYSTEM 10
3.2 LIMITATIONS OF EXISTING SYSTEM 10
3.3 PROPOSED SYSTEM 11
3.4 ADVANTAGES OF PROPOSED SYSTEM 11
4. SYSTEM DESIGN
4.1 DATA FLOW DIAGRAM 12
4.2 E-R DIAGRAM 16
4.3 DATABASE DESIGN 17
4.4 INPUT DESIGN 19
4.5 OUTPUT DESIGN 20
5. SYSTEM IMPLEMENTATION AND TESTING
5.1 SYSTEM IMPLEMENTATION 21
5.2 SYSTEM TESTING 22
6. CONCLUSION 26
8. BIBLIOGRAPHY 28
APPENDICES
i. TABLES 29
ii. FORMS 30
SYNOPSIS
The Project entitled as “Web Mining Browser” can be applied in real time using VB.NET
windows application and SQL Server 2005 for Creative solutions, Coimbatore.
Basic aim of this project is to create a browser application in VB.NET which works
offline and in online with user personalization data. Once the Internet connection is disconnected we
can browse the pages without interrupt. Stored pages are automatically updated using the auto
intelligent class. User can also read the RSS Feeds with the Built in RSS Reader. RSS Feeds are also
stored locally and updated automatically and user can also read the Feeds offline.
It also detects the URL’s in the page and stores the pages. This system is a
Personalizable one. Browser can be optimized for the different users. Once user name is provided
browser authenticates and load with the personalized settings
The system consists of modules such as User Registration, Link Extractor, Page Storage,
RSS Reader, and Feed Storage. User registration is used to register a new user to the system. A
unique userid is assigned for each user. Link Extractor module extracts the Hyperlinks available on
the HTML page and displays on the corner. Pages are stored in the database in binary format. RSS
Feeds are automatically detected for each website and the feeds are readed automatically. This
module stores RSS feeds on to the database for offline usage
Office access:
The offline pages are accessed without any interruption science the stored pages are
automatically updated using the auto intelligent class. User can also read the RSS Feeds with the
Built in RSS Reader. RSS Feeds are also stored locally and updated automatically and user can also
read the Feeds offline.
Personalization:
It also detects the URL’s in the page and stores the pages. This system is a
Personalizable one. Browser can be optimized for the different users. Once user name is provided
browser authenticates and load with the personalized settings
The system consists of modules such as User Registration, Link Extractor, Page Storage,
RSS Reader, Feed Storage. User registration is used to register a new user to the system. A unique
userid is assigned for each user. Link Extractor module extracts the Hyperlinks available on the
HTML page and displays on the corner. Pages are stored in the database in binary format.
Feed reader:
RSS Feeds are automatically detected for each website and the feeds are breaded automatically. This
module stores RSS feeds on to the database for offline usage
User registration:
User registration is used to register a new user to the system. A unique userid and password is
assigned for each user.
Quick launch:
The predefine link to the most used website is given as separate button by click it , we can
easily reach that page and our time is saved instead of typing the address in the address bar.
1. INTRODUCTION
In this chapter, a short profile about the organization and a brief introduction to the system is
presented
CreativeKSolutions are robust, scalable and it will easily integrate with a diverse range of products
and technologies. CreativeKSolutions expertise spans the entire gamut of application and custom
development. At CreativeKSolutions, the wide range of technological expertise, application
knowledge and consulting experience, enables to develop and integrate robust and scalable e-
business solutions that keep end customer's requirement in mind. The software development process,
supported by a proven onsite-offshore development methodology and quality management system,
shortens application development timeframes, providing significant business benefits to customers.
SYSTEM CONFIGURATION
Hardware configuration
Software configuration
SOFTWARE FEATURES
Front End: VB.NET
VB.NET is a simple, modern, object oriented language derived from C++ and Java.
It aims to combine the high productivity of Visual Basic and the raw power of C++.
We may develop Console application, Windows application, and Web application using
VB.NET.
In VB.NET Microsoft has taken care of C++ problems such as Memory management, pointers
etc.
It supports garbage collection, automatic memory management and a lot.
Value types (primitive types) are initialized to zeros and reference types
VB.NET allows the users to use pointers as unsafe code blocks to manipulate your old code.
Components from VB.NET and other managed code languages and directly be used in
VB.NET.
SQL server is a collection of many objects, such as tables, views, stored procedures and
constraints.
SQL server is owned by a single user account but can contain objects owned by other
users.
SQL Server has its own set of system tables, which catalog gives the definition of the
database.
SQL Server maintains its own set of user accounts and security.
SQL server is the primary unit of recovery and maintains logical consistency among
objects in the database.
SQL Server has its own transaction log and manages the transactions within the
database.
SQL Server can participate in two-phase commit transactions with other SQL server
SQL Server can range in size from 1 MB through a theoretical limit of 1 TB in size.
SQL Server can grow and shrink, either automatically or by command.
SQL Server can have objects joined in queries with objects from other databases in the
SQL server is conceptually similar to but richer than the ANSI SQL-schema concept.
Network administrator can work more efficiently because many of the most common
computer management tasks are automated and streamlined with Windows 2000 Professional. It
offers increased compatibility with different types of network and with wide array of legacy
hardware and software.
Basic features
• It provides improved driver support.
Active desktop allows customizing the workspace and the address bar helps to connect to
the Internet from any window.
A variety of tools that helps to communicate with people and other computers are
available.
The communication tool is used to send e-mail, handle phone calls, send a fax, conduct a
meeting with videoconference, etc.
Drawbacks
It is available only for browsing the website and stores them in cache memory.
Once browser history was erased all the browser history and WebPages are being erased.
Only URL addresses are stored as history.
It is not personalized one (i.e.) Not particular to one user data is being considered while
browsing.
Proposed system contains RSS [Really Simple and Syndication] Feed Reader. Rss Reader
reads the RSS of a website automatically. It retrieves RSS feed data and displays them with more
easy to read user interface.
Benefits
Personalized Browser settings and page storage.
Automatically updates the page if any change acquired on the web page.
Allows user to browse offline without the Internet connection.
Allow users to read RSS Feeds from the browser itself.
RSS feeds are updates automatically and available for offline reading.
SYSTEM DESIGN
System Design is a solution, a “how to” approach to the creation of new system It provides the
understanding and procedural details necessary for implementing the system recommended in the
feasibility study. A Design goes through the logical and physical stages of development. Design is a
creative process that involves working with the unknown new system, rather than analyzing the
existing system. Thus, in analysis it is possible to produce the correct model of existing system.
Table design
A database is a collection of inter-related data with minimum redundancy to serve the user
quickly and efficiently. The data are stored in tables. We have learned that data provide the basic
information system. Without data there is no system, but the data must be provided in the right form
for input and the information produced must be in a format acceptable to the user. The tables that are
used are USER TABLE, FEED TABLE, and WEBPAGE TABLE.
Data Flow Diagram (DFD) is a modeling tool that allows picturing system as a network of functional
process to one another by pipelines of data. They are also widely used for representation of external
and top-level design specification. The DFD shows the interface between the system and external
terminators. Data Flow Diagram is also called as” Bubble Chart”. The bubble represents the process,
the line represents the data flow and rectangle represents the entity.
Entity Relationship Diagram (ERD) is a model that describes the store layout of a system at a high
level abstraction. ER-Diagram enables to examine and highlights the data structure and
relationship between data stores in the DFDs. Based on the information provided needed to
access the database record efficiently.
SYSTEM DESIGN
FUNDAMENTAL DESIGN CONCEPTS
System design sits in the technical kernel of software engineering and applied science
regardless of the software process model that is used. Beginning once the software requirements
have been analyzed and specified, tests that are required in the building and verifying the software is
done. Each activity transforms information in a number that ultimately results in validated computer
software.
There are mainly three characteristics that serve as guide for evaluation of good design,
• The design must implement all of explicit requirements contained in the analysis model, and it
must accommodate all of the implicit requirements desired by the customer.
• The design must be readable, understandable guide for those who generate code and for those
who test and subsequently support the software.
• The design should provide a complete picture of software, addressing the data, its functional
and behavioral domains from the implementation perspective.
Input design is the process of converting the user-oriented description of the computer
based business information into program-oriented specification. The goal of designing input data is
to make the automation as easy and free from errors as possible.
Logical Design of the system is performed where its features are described, procedures
that meet the system requirements are formed and a detailed specification of the new system is
provided.
External Design of software involves conceiving, planning and specifying the externally
observable characteristics of the software product. The external design begins in the analysis phase
and continues till the design phase.
As per the design phase the following designs had to be implemented, each of these
design were processed separately keeping in mind all the requirements, constraints and conditions. A
step-by-step process was required to perform the design.
Process Design is the design of the process to be done; it is the designing that leads to
the coding. Here the conditions and the constraints given in the system are to be considered.
Accordingly the designing is to be done and processed.
The Output Design is the most important and direct source of information to the user.
The output design is an ongoing activity during study phase. The objectives of the output design
define the contents and format of all documents and reports in an attractive and useful format.
After the successful study of requirement analysis the next step involved is the
Design and Development phase that practically helps to build the project.
The Linear Sequential Model or Classic Life Cycle or the Waterfall Model develops
project. This is a sequential approach to software development that begins at the system level and
progresses through analysis, design, coding and testing.
Code Generation
The design must be translated into a machine-readable form. The code generation step
performs this task. If design is performed in a detailed manner, code generation can be accomplished
mechanistically.
After completing the design phase, code was generated using Visual Basic environment
and the SQL Server 2005 was used to create the database. The server and the application were
connected through ADO.Net concepts.
The purpose of code is to facilitate the identification and retrieval of items of information.
Codes are built with the mutually exclusive features. They are used to give operational distractions
and other information. Codes also show interrelationship among different items. Codes are used for
identifying, accessing, sorting and matching records. The code ensures that only one value of code
with single meaning is correctly applied to give entity or attribute as described in various ways.
Codes can also be designed in a manner easily understood and applied by the user.
The coding standards used in the project are as follows:
1. All variable names are kept in such a way that it represents the flow/function it is serving.
2. All functions are named such that it represents the function it is performing.
SYSTEM IMPLEMENTATION
A software application in general is implemented after navigating the complete life cycle
method of a project. Various life cycle processes such as requirement analysis, design phase,
verification, testing and finally followed by the implementation phase results in a successful project
management. The software application which is basically a web based application has been
successfully implemented after passing various life cycle processes mentioned above.
As the software is to be implemented in a high standard industrial sector, various factors
such as application environment, user management, security, reliability and finally performance are
taken as key factors through out the design phase. These factors are analyzed step by step and the
positive as well as negative outcomes are noted down before the final implementation.
Security and authentication is maintained in both user level as well as the management
level. The data is stored in Access 2000 as RDBMS, which is highly reliable and simpler to use, the
user level security is managed with the help of password options and sessions, which finally ensures
that all the transactions are made securely.
The application’s validations are made, taken into account of the entry levels available in
various modules. Possible restrictions like number formatting, date formatting and confirmations for
both save and update options ensures the correct data to be fed into the database. Thus all the aspects
are charted out and the complete project study is practically implemented successfully for the end
users.
SYSTEM TESTING
Software testing is a critical element of software quality assurance and represents the
ultimate review of specification, design and code generation. Once the source code has been
generated, software must be tested to uncover as many errors as possible before delivery to the
customer. In order to find the highest possible number of errors, tests must be conducted
systematically and test cases must be designed using disciplined techniques.
Types of testing
White box Testing
White box testing some times called as glass box testing is a test case design method that
uses the control structures of the procedural design to derive test cases.
Using White Box testing methods, the software engineer can derive test case, that
guarantee that all independent paths with in a module have been exercised at least once, exercise all
logical decisions on their true and false sides, execute all loops at their boundaries and within their
operational bounds, exercise internal data structures to ensure their validity. “Logic errors and
incorrect assumptions are inversely proportional to the probability that a program path will be
executed“.
The logical flow of a program is some times counterintuitive, meaning that unconscious
assumptions about flow of control and data may lead to make design errors that are uncovered only
once path testing commences.
“Typographical errors are random“
When a program is translated into programming language source code, it is likely that
some typing errors will occur. Many will be uncovered by syntax and typing checking mechanisms,
but others may go undetected until testing begins. It is as likely that a type will exist on an obscure
logical path as on a mainstream path.
Unit testing
Static analysis is used to investigate the structural properties of source code. Dynamic
test cases are used to investigate the behavior of source code by executing the program on the test
data. This testing was carried out during programming stage itself.
After testing each every field in the modules, the modulus of the project is tested
separately. Unit testing focuses verification efforts on the smallest unit of software design and field.
This is known as field - testing.
6. CONCLUSION
The implementation and testing has been done in a step-by-step process. Each module
has been developed and tested individually to obtain the necessary required output in the desired
form. The project is full-fledged and user-friendly. The system has greatly reduced the clerical
overhead and drastically reduced the time taken in the products. The system satisfies all
requirements needed by the user. I conclude the software as best to my knowledge.
The software developed has been designed and run to satisfy the requirements and needs of
the organization as well as the end users. The system reduces the manual work of maintenance of the
records. It has also resulted in quick retrieval and reference of required information, which is vital to
The entire system is documented and can be easily understood by the end users. The form are
very user friendly and also easy to handle even by the beginners with very little effort and guidance.
• Registered users could be well informed about the new website integration through E-mail.
• The data screens can be upgraded and menus can be easily added when required. user details
can be added to the forms when there comes necessity of new data.
• The system has much scope in the future an it can be developed to add more features to satisfy
the user‘s request.
9. BIBLIOGRAPHY
Books Referred:
4. Karli Watson, Richard Anderson , “Professional ASP.NET 1.1” , 2004 Edition, Wrox
Publications
Websites:
1. www.msdn.microsoft.com
2. www.vbcity.com
3. www.vbdotnetheaven.com
4. www.codeguru.com
Appendix
admin
Invalid
Enter
User
name
Admin
&
/user
password
Invalid
Enter
User
name
Admin
&
/user
password
Data flow diagram: