Vous êtes sur la page 1sur 24

INTRODUCTION

 The name of our project is www.postoutopinion.com


“a business solution providers”
 This is a full stack project having four domains.
 Front end website with customer demand.
 Backend which supports database flexibility. ( Hadoop, hive, pig, sqoop )
 Third and most important data analysis part. With the help of web scraping
and data analysis.
Objective

 Project is based on data analysis for the development of college or any


organization through a full developed website.
 We are providing solutions to websites , colleges and other organizations by
comparing their data with the other developed organizations.
 After analysing the data of the college entered by the students and faculties
and comparing them with other colleges we can conclude the basic answers
easily.
Problem Domain
 There is a huge communication gap between the students and the college
development team.
They are unaware of students demand and what their own students think
about them.
Whether they are satisfied with the teaching faculties or not and many
more questions.
Solution Domain
The best solution is a place where there data can be used for their
betterment which is provided by our website.
After analysing the data of the college entered by the students and
faculties and comparing them with other colleges we can conclude these
basic answers easily.
There are some question which will be asked from the students and on
the basis of answers we will draw conclusion and provide answers in the
form of pie chart and graphs .
The data will be in G.Bs. and this will require bigdata technologies like
Hadoop , hive, pig , sqoop, flume etc.
Domains

Predictive analytics Distributed file stores


NoSQL databases  Data virtualization
Search and knowledge
Data integration
discovery
Data preparation
Stream analytics
In-memory data Data quality
fabric
THE SOLUTION WILL BE PROVIDED
APPLICATION DOMAIN
The overall data will be around 50000 entries , which is to be
analyse first before coming to any solution .
 Our project is used for colleges basically it improves the
colleges with the help of feedback.
EXPECTED OUTCOME
 Solution of the business using the data provided by the opinion of users.
 A well devoleped website for a data analysis.
 With college data analysis questions like mentioned below can be answered
easily.
 Why we are behind s.g.s.i.t.s or iit indore ?
Are girls safe in college ?
What do students think about their college ?
Whether students are satisfied with teachers ?
Is there any issue of ragging in the college ?
Data Requirements
 Data requirements are prescribed directives or consensual agreements that define the content
and/or structure that constitute high quality data instances and values.

 Data requirements are required as a prerequisite to measure data quality.

 There are two typical challenges when we gather data requirements. First, we capture and
document the requirements in a way the person capturing them understands.

 Second, we explore and model the required data structures solely based on the system of
record data structures or the users reporting requirements
Functional Requirements
 Defines the external behavior of the system.

 How the system interacts with its environment

 How it responds to input

 What output to generate

 How to behave under certain conditions

 In other words, it illustrates what the system does.


Non Functional Requirements
 Defines the quality attributes or the constraints of the system.

 Some examples of quality attributes include security, performance, and availability.

 Technology Constraint – For example, the system must be implemented using Java technology.

 Process Constraint – A process constraint could be that the new system must be developed with the
RUP methodology due to its architectural complexity.

 The response time for a search transaction under a peak load of 10,000 users must not exceed 2 seconds.
Use Case Diagram
Activity Diagram
Data Flow Diagram
Data Flow Diagram Level 1
Class Diagram
State Diagram
Sequence Diagram
Deployment Diagram
Limitations:-
 The result and conclusion drawn upon by using analysis are not very accurate.

 College data analysis is not a complete solution to any college issue as there are many dominant
variables between research and response.

 Inappropriate knowledge can lead to misapprehension of questions to be asked for data collection..

 Improper interpretation of Data.

 It is interesting and shocking to state that data analysis does not solve any problem directly but cam assist to
solve it.

 When a huge volume of continuously generated data exists, the veracity issue arises to address the
uncertainty, validity of the data.
Future Enhancement:-

 The future value of the college data analysis will be achieved by harnessing the collaboration between
the combined strength of students and the tools.

 Potential improvement can be made to our data collection and analysis method.

 Future research can be done with possible improvement such as more refined data.
REFERENCES
 Internet ( websites like www.stackoverflow.com ).
 A new website called survey monkey.
 Inspired by amazon working system.
 Bigdata experts .
 www.tutorialpoints.com for html and css for website building.
www.draw.io.com for UML diagrams.
www.creatly.com for UML diagrams.

Vous aimerez peut-être aussi