Repository for Petra's work at ampli Jan-Feb 2019
Petra Lamborn d4e459128d Bunch of proper python scripts 5 vuotta sitten
R Assign a new sample to clusters; look at how well it is modeled 5 vuotta sitten
img Clustering file 5 vuotta sitten
py Bunch of proper python scripts 5 vuotta sitten
sql Start writing readme 5 vuotta sitten
.gitignore A harmonic model for annual patterns in rolling 24-hour minimum temperature 5 vuotta sitten
README.md Further readme 5 vuotta sitten
notes.md Add and reorganise notes 5 vuotta sitten
requirements.txt Update requirements 5 vuotta sitten

README.md

ampli

Repository for Petra’s work at ampli Jan-Feb 2019

What isn’t in this repository

This repository does not contain with it downloaded data (for confidentiality and size reasons), the fitted models (similar issues), or the configuration file for downloading from the database (which contains a password). Recreate an empty data/ directory and model/. The configuration file is at py/database.ini and looks like:

[postgresql]
host=<Hname>
database=<dbname>
user=<Username>
password=<Password>

This file is based on an example from the postgresSQL website. Replace <Username> and <Password> with valid credentials.

SQL

I’ve included a bunch of annotated SQL queries in sql/queries.pgsql and sql/weather.pgsql. The latter file is specifically about the weather dataset; the former is more general. Some of the queries are copied into the py/util.py file, but changing these two files will do nothing on their own.

Python

requirements.txt contains the python packages required to set up a virtual environment with virtualenv -p /usr/bin/python3 venv and pip install -r requirements.txt. Notably these are:

  • numpy
  • pandas
  • pkg-resources
  • psycopg2-binary
  • pyparsing
  • python-dateutil
  • pytz
  • scipy
  • seaborn
  • statsmodels