Citrics

You can find the project Here. The API Docs for the DS app can be found Here

High level overview presentation Here.

Deep dive into cleaning the data Here.

Datasets used Here

Contributors

V2

Eric Ramon

Jimmy Smith

V1

Scott Maxwell	Matthew Sessions	Luke Townsend

Project Overview

Trello Board

Product Canvas

Citrics provides statistics on 28,924 different locations in the United States that are available for viewing. This was created with a team of web developers and data engineers. These statistics include information about housing prices, employment, lifestyle and much more.

Deployed Front End

Tech Stack

Python
Flask
Docker
Jupyter Notebooks
Mongo DB
AWS Elastic Beanstalk
AWS PostgreSQL

Predictions

The following models are using a K-Nearest Neighbors KD-Tree algorithm from the Scikit-Learn Python Library

Housing Model:

Features & Metrics Used:

Median Rent
Occupants per room
Housing by bedrooms
Vacancy Rate
Rent Pricing
Historical Property Value
Historical Property Value Growth by %

Industry Model:

Features & Metrics Used:

Industry Types
Health insurance
Salary
Commute & travel time
Retirement
Unemployment

Culture Model:

Features & Metrics used:

Education
Language
Ethnicity
Birth Rate
Population

Note: AWS EB has a hard time runing Numpy and Scipy. These libraries power Sklearn. Also, the joblib library had a hard time running models that were trained on different operating systems. Once we found models that worked, we exported the code to a python script and ran it on a Linux based machines runing python 3.6. We then used Docker to contain and ship our flask app. These steps allowed us to seamlessly deploy predictive models.

Name		Name	Last commit message	Last commit date
Latest commit History 168 Commits
Model_Scripts		Model_Scripts
Notebooks		Notebooks
data-help		data-help
flask-docker-master		flask-docker-master
images		images
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
code_of_conduct.md		code_of_conduct.md
ipdata_db.py		ipdata_db.py
pull_request_template.md		pull_request_template.md
to_datebase.py		to_datebase.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Citrics

High level overview presentation Here.

Deep dive into cleaning the data Here.

Datasets used Here

Contributors

V2

V1

Project Overview

Tech Stack

Predictions

The following models are using a K-Nearest Neighbors KD-Tree algorithm from the Scikit-Learn Python Library

Housing Model:

Industry Model:

Culture Model:

Data Sources

Python Notebooks

Fixing City Names:

Different types of data and sources

Housing Data

Models (for suggesting similar cities):

How to connect to the data API

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Citrics

High level overview presentation Here.

Deep dive into cleaning the data Here.

Datasets used Here

Contributors

V2

V1

Project Overview

Tech Stack

Predictions

The following models are using a K-Nearest Neighbors KD-Tree algorithm from the Scikit-Learn Python Library

Housing Model:

Industry Model:

Culture Model:

Data Sources

Python Notebooks

Fixing City Names:

Different types of data and sources

Housing Data

Models (for suggesting similar cities):

How to connect to the data API

About

Resources

License

Code of conduct

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages