Vous êtes sur la page 1sur 18

TABLE OF CONTENTS

CHAPTERS PAGE NO
1. COMPANY PROFILE 1
1.1 About Abiyaantrix and Sapience Academy 1
1.2 About the founder 1
1.3 Organisation structure 1
1.4 Services offered of the company 2
1.5 Working process of the company 2
1.6 Design capabilities 2
1.7 People working in the organization 2
2. INTRODUCTION 4
2.1 Objective 4
2.2 Problem statement 4
2.3 Proposed solution 4
2.4 Organisation of the report 4
3. TASK PERFORMED 5
4. REFLECTION NOTES 6
4.1 Experience assessments 6
4.2 Technical outcomes 6
4.3 Personality development 6
4.4 Time management 7
4.5 Skills 7
SNAPSHOTS 8
5. CONCLUSION 14
References 15
LIST OF FIGURES
FIGURE NUMBER DESCRIPTION PAGE NUMBER

5.1 The initial data set 7

5.2 Description of crime with 7


date and time

5.3 Head count of each type of 8


crime

5.4 White grid plotting of crimes 8

5.5 Graph of Arrests (Yearly and 9


monthly)

5.6 Graph of Arrests (Yearly and 9


monthly)

5.7 Graph of Domestic violence 10


(Yearly and monthly)

5.8 Graph of Domestic violence 10


(Weekly and daily)

5.9 Top 5 monthly crimes (2014 11


and 2015)

5.10 Top 5 weekly crimes (2014 11


and 2015)

5.11 Top 5 daily crimes 12

2014 and 2015


Chicago Crime Dataset

ABSTRACT

Data science is a multi-disciplinary field that uses scientific methods, processes,


algorithms and systems to extract knowledge and insights from structured and unstructured
data. Data science is the same concept as data mining and big data: "use the most powerful
hardware, the most powerful programming systems, and the most efficient algorithms to solve
problems". Data mining applications are utilized in many banking sectors for client
segmentation and productivity, credit scores and authorization, predicting payment default,
advertising, detecting fake transactions, etc. A general idea about the model of Data Mining
techniques and diverse crimes is presented. It also provides an inclusive survey of competent
and valuable techniques on data mining for crime data analysis. The objective of crime data
mining is to recognize patterns in criminal manners in order to predict crime, anticipate
criminal activity and prevent it. We implement a novel data mining technique like K-Means
and Influenced Association Classifier for investigating the crime data sets and sorts out the
accessible problems. The K-Means algorithm is being utilized for unsupervised learning
cluster within influenced Association Classification. K-means selects the initial centroids so
that the classifier can mine the record. The collective knowledge of K-Means and Influenced
Association Classifier tends certainly to afford an enhanced, incorporated, and precise result
over the crime prediction in the banking sectors. Our law enforcement organizations require
to be adequately outfitted to defeat and prevent the crime.

Dept of CSE, VVCE


Chicago Crime Dataset

Chapter 1

COMPANY PROFILE

1.1 About Company


Abiyaantrix and SAPience Academy and Management Solutions are rendering services to
educational institutions, corporates, government organizations and NGOs. Abiyaantrix and SAPience
have experts in academics with many years of teaching and corporate training exposure. They have
trained over 1,20,000 corporate professionals, trainers, teaching professionals, students, and have
contributed immensely to the success of our clients. They have operations in Karnataka, Tamil Nadu
and Maharashtra. Their facilitators have travelled far and wide, and their clientele consists of Indian
and International (USA, Europe, Singapore and Dubai) corporates/academic institutions. Their success
is not in the number of projects we undertake or our annual turnover; their success lies in the change
we bring about in the organizations and educational institutions they are associated with. They focus
on quality, not just quantity. Their contentment lies in enhancing knowledge, skills, attitudes and
molding individuals. Today's highly competitive world requires experienced, skilled and hardworking
professionals to get an edge in the "rat race." It’s their privilege to partner with us in this endeavor.

1.2 About the Founder


The idea was initiated by Shashi Kiran who is the backbone of Abiyaantrix and Sapience
Academy and his better half Anjana. Both of them had this passion to empower people and make a
difference in their lives through Training and Facilitation. Mr. Harshith Ramesh and Mr. Srikrishna S
Kashyap is the director and trainer respectively of Abiyaantrix (Abiyaanprashikshana Tech Solutions
(P) Ltd). His skills include Marketing Communications, HR Consulting, team management, project
management, public speaking, talent acquisition and business analysis. He is proficient in Java, .NET,
HTML, PHP, C++, Cloud Computing and Microsoft Office.

1.3 Organization Sector


Abiyaantrix Private Limited is a private company. It is classified as Non-Government
company and is registered at Registrar of Companies, Bangalore. It is a business process
outsourcing and offshore outsourcing solution with the local consulting presence that integrates
business strategy with execution.

Dept of CSE, VVCE 1


Chicago Crime Dataset

1.4 Services offered by the Company


They conduct Training programs for Engineering Students/Professionals/Any graduates or Post
graduates in their respective fields which will help them in getting better job opportunities.
Technical Training Programs include (to name a few) MEAN STACK, Web Development,
Machine Learning, Embedded Systems, Nano Technology, RF & Microwaves, Computer
Networking, VLSI, MATLAB, Arduino and Raspberry Pi, Switch Gears, Manufacturing
Technology, Design Engineering, Automobile, Thermal and Fluid Dynamics, Tool Designing and
much more. Abiyaantrix Tech Solutions Private Limited is involved in perfectly understanding
the client’s scenario and carefully ensure that their interests are well protected. They also bring in
the best people with the right expertise to help solve their client’s problems and constantly provide
them support. They make sure the client reaches the goal.

EARLY-> EFFICIENT ->ECONIMIC

1.5 Working Process of the Company


Abiyaantrix-SAPience follows dynamic, participant-centered, multi-sensory learning
format that has been proven to dramatically accelerate learners' acquisition of knowledge and
skills. In Abiyaantrix-SAPience, every trainer/facilitator undergoes a Train-the-Trainer (TTT)
Programme followed by a meticulous three-way process of internal certification by the Master
Trainers: Observations of the sessions conducted by experienced trainers, Co-facilitation with
experienced trainers and Solo-facilitation with rigorous auditing. Their thinking-outside-the-box
training meets the accelerated demands of today's learners where information must be effectively
prioritized and quickly assimilated and integrated to improve performance. Abiyaantrix and
SAPience customize training content and training to cater to the diverse and specific needs of the
clients across domains.

1.6 Design capabilities


Abiyaantrix is an established training center having latest computers with latest software.
These hi-tech facilities are interconnected through internal networking and high-speed internet.

1.7 People working in the Organisation


They are a Team of Professionals working in the field of IT and Soft Skills with a
collective experience of 17 years who have worked with Universities, Academic Institutes,
Corporates and Schools.
Dept of CSE, VVCE 2
Chicago Crime Dataset

Anjana Shashi Kiran, the director, has a total of 17 years of experience. She has been trained
in USA and brings in a wealth of knowledge, experience and expertise in Business English Skills, Soft
Skills, Voice and Accent, Clinical Data Management, Medical Transcription, Aptitude
- Verbal Ability, Campus to Corporate Programs, Training partner and Recruiter for many
institutes. She has worked with Accenture, America Online, Infosys, DELL, Philips and many
more. Academic Institutes: REVA, Vidyavardhaka College of Engineering, ATME, Siddhartha
College of Engineering, Teresian, St. Philomena's and many more.
Srikrishna S Kashyap, the trainer, has been a part of NASA as Galactic Mentor. He also
has worked with Google for the Front-End Web Development. He has undergone training with
ISRO and worked on Profile Orbit Target File Generator. He has worked with dozens of start ups
and other organizations in the "Web Development" field.
Harshith Ramesh has completed his B.E. in Mechanical Engineering. He brings in his
expertise in Project Management. He has been an Entrepreneur with a dynamic personality. He
has many credentials where he has been exemplary in Business English Skills, Team Work, Short
Film Making and Web Development Programs. His areas of expertise are Project Management,
Strategic Planning, Marketing and Training Aptitude Based Programs.
Abiyaantrix is a quality training provider, having a young, talented and experienced team.
They are best described by words confident, competent and caring. They explore opportunities
around and try to grow along with their clients. They are strong believers of having long term
relations with all businesses and people.

Dept of CSE, VVCE 3


Chicago Crime Dataset

Chapter 2

INTRODUCTION
2.1 OBJECTIVE
To analyse large amounts of data within small time frames, organisations prefer working
with the data directly over samples.

2.2 PROBLEM STATEMENT


To analyse the Chicago Crime Data set.

2.3 PROPOSED SOLUTION


The proposed system is writing a code using MEAN stack. MEAN is a free and open-
source JavaScript software stack for building dynamic web sites and web applications. The MEAN
stack is MongoDB, Express.js, AngularJS (or Angular), and Node.js. Because all components of
the MEAN stack support programs are written in JavaScript, MEAN applications can be written
in one language for both server-side and client-side execution environments. The code analyses
large amount of data within small time frames. It saves the time of data scientists.

2.4 ORGANISATION OF THE REPORT


The project report is organized as follows: Chapter 1 includes company profile. Chapter 2
gives the introduction. Chapter 3 provides information about the tasks performed. Finally, chapter
4 depicts the reflection notes.

Dept of CSE, VVCE 4


Chicago Crime Dataset

Chapter 3
TASK PERFORMED

REALISATION OF OBJECTIVE

A dataset is provided to us. This dataset reflects reported incidents of crime (with the
exception of murders where data exists for each victim) that occurred in the City of Chicago from
2001 to present, minus the most recent seven days. Data is extracted from the Chicago Police
Department's CLEAR (Citizen Law Enforcement Analysis and Reporting) system. In order to
protect the privacy of crime victims, addresses are shown at the block level only and specific
locations are not identified. These crimes may be based upon preliminary information supplied to
the Police Department by the reporting parties that have not been verified. The preliminary crime
classifications may be changed at a later date based upon additional investigation and there is
always the possibility of mechanical or human error. Therefore, the Chicago Police Department
does not guarantee (either expressed or implied) the accuracy, completeness, timeliness, or correct
sequencing of the information and the information should not be used for comparison purposes
over time. The dataset contains more than 65,000 records/rows of data and cannot be viewed in
full in Microsoft Excel. Therefore, when downloading the file, select CSV from the Export menu.
Open the file in an ASCII text editor, such as WordPad, to view and search. We analyse the given
large amount of dataset in a smaller time frames so that the organisations can work with data
directly over samples.

Dept of CSE, VVCE 5


Chicago Crime Dataset

Chapter 4

REFLECTION NOTES

4.1 EXPERIENCE AND ASSESSMENTS


I completed my internship in data science and web designing at Abiyaantrix Tech
solutions. During this one-month internship, first, I was introduced to the basics of HTML and
CSS and its syntax. Various HTML tags, CSS styles were used to write simple programs to
demonstrate the use of HTML in web development. Appropriate tasks were assigned to create
simple templates using these features. Then, JavaScript was introduced. Topics of data science
was made to analyse using python programming. Tasks were assigned from time to time and had
to be submitted within the given deadlines. This ensured continuous assessment and thorough
understanding of the subject.

4.2 TECHNICAL OUTCOMES


Upon completion of this project, I was able to:
• Use fundamental skills to maintain web server services required to host a website.
• Select and apply mark-up languages for processing, identifying and presenting of
information in web pages.
• Flush the unwanted data or the null data from a given dataset.
• Analyse huge datasets in small time frames.

4.3 PERSONALITY DEVELOPMENT


This internship helped me develop a personal work ethic and be able to investigate my
career interests, prospective career goals and my approach to a professional workspace. It also
gave me an opportunity to interact with the professionals and learn how to communicate in a
professional environment. It has introduced me to a lot of useful resources and has helped me
acquire references. As an intern I have learnt that time management is vital in every circumstance
whether your attending sessions, finishing tasks on deadlines or meeting your mentor. Last but
not the least, it has improved my chances as a job applicant and helped me become a better
potential employee.

Dept of CSE, VVCE 6


Chicago Crime Dataset

4.4 TIME MANAGEMENT


Working as an intern helped me realize that every minute counts in a corporate world. First
thing I learnt is to make a schedule and stick to it. I started organizing my days and weeks in
advance so that even if there were last minute changes, I could somehow manage to cope up with
it.
The second lesson I learnt is how to prioritize. To efficiently execute any task, I needed to
decide which stages or components of the task are most important and of more impact in the short,
medium and long terms. To set some boundaries that will deliver the best returns.

4.5 SKILLS
• Clear fundamentals of HTML and CSS. To be able to write HTML from scratch.
• To be able to create websites for optimal user experience that are compatible with mobile
devices.
• Analytical skills, that is to have a strong understanding of consumers that will help you
create websites that sell.
• Easily analyse a considerably huge dataset without letting the system crash.
• Construct heatmaps to the data whenever and wherever required for better understanding
of the dataset.
• Not only to have great skills at coding, but prior to that to be great at resolving problems
and to give birth to great ideas.

Dept of CSE, VVCE 7


Chicago Crime Dataset

SNAPSHOTS

Figure 5.1: The initial data set

Figure 5.2: Description of crime with date and time

Dept of CSE, VVCE 8


Chicago Crime Dataset

Figure 5.3: Head count of each type of crime

Figure 5.4: White grid plotting of crimes

Dept of CSE, VVCE 9


Chicago Crime Dataset

Figure 5.5: Graph of Arrests (Yearly and monthly)

Figure 5.6: Graph of Arrests (Weekly and daily)

Dept of CSE, VVCE 10


Chicago Crime Dataset

Figure 5.7: Graph of Domestic violence (Yearly and monthly)

Figure 5.8: Graph of Domestic violence (Weekly and daily)

Dept of CSE, VVCE 11


Chicago Crime Dataset

Figure 5.9: Top 5 monthly crimes 2014 and 2015

Figure 5.10: Top 5 weekly crimes 2014 and 2015

Dept of CSE, VVCE 12


Chicago Crime Dataset

Figure 5.11: Top 5 daily crimes 2014 and 2015

Dept of CSE, VVCE 13


Chicago Crime Dataset

CHAPTER 5

CONCLUSION

An overwhelming expansion of data archives posed a challenge to various industries,


as these are now struggling to make use of such enormous amount of information. Almost 90%
of all data ever recorded worldwide has been created in the last decade alone. In this project
we have explored the data and it provides the insights and forecasts about crimes in Chicago.
It extracts the data from Chicago Police Department’s CLEAR (Citizen Law Enforcement
Analysis and Reporting) system. It contains information on reported incidents of crime in the
city of Chicago from 2001 to present. The proposed model generates a superior concept over
the crime prediction by implementing the novel data mining and machine learning techniques.
Awareness agenda should be put into practice to guarantee that clients recognize data concerns,
intensity of privacy and the method to make the banking transactions secure.

Dept of CSE, VVCE 14


Chicago Crime Dataset

REFERENCES

[1] https://data.cityofchicago.org/Public-Safety/Crimes-2001-to-present/ijzp-q8t2

[2] https://data.cityofchicago.org/Public-Safety/Chicago-Police-Department-Illinois-
Uniform-Crime-R/c7ck-438e

[3] https://portal.chicagopolice.org/portal/page/portal/ClearPath

[4] https://en.wikipedia.org/wiki/Chicago_Crime_Commission

[5] https://elitedatascience.com/data-cleaning

[6] http://mean.io/

Dept of CSE, VVCE 15