Vous êtes sur la page 1sur 4

Kaggle Springleaf Data

Proposition Data

Springleaf Marketing Response


https://www.kaggle.com/c/springleaf-marketing-response/details/timeline

Home
Determine whether to send a direct mail piece to a customer
Springleaf puts the humanity back into lending by offering their customers
personal and auto loans that help them take control of their lives and their
finances. Direct mail is one important way Springleaf's team can connect with
customers whom may be in need of a loan.

Direct offers provide huge value to customers who need them, and are a
fundamental part of Springleaf's marketing strategy. In order to improve their
targeted efforts, Springleaf must be sure they are focusing on the customers
who are likely to respond and be good candidates for their services.
Using a large set of anonymized features, Springleaf is asking you to predict
which customers will respond to a direct mail offer. You are challenged to
construct new meta-variables and employ feature-selection methods to
approach this dauntingly wide dataset.

Data
Data Files
See this example R Script that trains an XGBoost model and creates a
submission
You are provided a high-dimensional dataset of anonymized customer
information. Each row corresponds to one customer. The response variable
is binary and labeled "target". You must predict the target variable for every
row in the test set.
The features have been anonymized to protect privacy and are comprised of a
mix of continuous and categorical features. You will encounter many
"placeholder" values in the data, which represent cases such as missing values.
We have intentionally preserved their encoding to match with internal systems
at Springleaf. The meaning of the features, their values, and their types are
provided "as-is" for this competition; handling a huge number of messy
features is part of the challenge here.

Information
Evaluation
Submissions are evaluated on area under the ROC curve between the predicted
probability and the observed target.

Submission File
For each ID in the test set, you should predict a probability. The file should
contain a header and have the following format:

Vous aimerez peut-être aussi