Repository for Petra's work at ampli Jan-Feb 2019
Petra Lamborn bcbb6fe5b9 Clustering file 5 年之前
R Assign a new sample to clusters; look at how well it is modeled 5 年之前
img Clustering file 5 年之前
py Clustering file 5 年之前
sql Start writing readme 5 年之前
.gitignore A harmonic model for annual patterns in rolling 24-hour minimum temperature 5 年之前
README.md Further readme 5 年之前
notes.md Add and reorganise notes 5 年之前
requirements.txt Update requirements 5 年之前

README.md

ampli

Repository for Petra’s work at ampli Jan-Feb 2019

What isn’t in this repository

This repository does not contain with it downloaded data (for confidentiality and size reasons), the fitted models (similar issues), or the configuration file for downloading from the database (which contains a password). Recreate an empty data/ directory and model/. The configuration file is at py/database.ini and looks like:

[postgresql]
host=<Hname>
database=<dbname>
user=<Username>
password=<Password>

This file is based on an example from the postgresSQL website. Replace <Username> and <Password> with valid credentials.

SQL

I’ve included a bunch of annotated SQL queries in sql/queries.pgsql and sql/weather.pgsql. The latter file is specifically about the weather dataset; the former is more general. Some of the queries are copied into the py/util.py file, but changing these two files will do nothing on their own.

Python

requirements.txt contains the python packages required to set up a virtual environment with virtualenv -p /usr/bin/python3 venv and pip install -r requirements.txt. Notably these are:

  • numpy
  • pandas
  • pkg-resources
  • psycopg2-binary
  • pyparsing
  • python-dateutil
  • pytz
  • scipy
  • seaborn
  • statsmodels