
We used to keep private credentials on production servers without any protection or encryption. Well, luckily we don’t have any leak but this practice is not recommended for both security and easy of use reasons.
Since AWS finally provides KMS(Key Management Service) in our local region, we try to encrypt every private credentials by KMS and store them on S3.
TBD
Partitioning refers to splitting what is logically one large table inot smaller physical pieces.
Currently, PostgreSQL supports partitioning via table inheritance. Each partition must be created as a child table of a single parent table. The parent table itself is normally empty; It exists just to represent the entire data set.
There are two forms of partitioning can be implemented in PostgreSQL:
Range Partitioning
The table is partitioning into “range” defined by a key column or a set of columns, with no overlap between the ranges of values assigned to different partitions. eg. partition by date ranges or by identifiers.
List Partitioning
The table is partitioned by explicitly listing which key values appear in each partition.
Create the “master” / “parent” table, from which all the partitions will inherit.
This table will not contain any data. Do not define any check on this table, unless you intend them to be applied equally to all partitions. There is no point in defining any indexes or unique constraints on it either.
Create “child” tables that each inherit form the master table. Normally, these tables will not add any columns to the set inherited from the master.
Add table constraints to the partition tables to define the allowed key values in each partitions.
Ensure that the constraints guarantee that there is no overlap between the key values premitted in different partitions. And there is no difference in syntax between range and list partitioning.
Create indexes on column(s) for each partitions.
Optionally, define a trigger or rule to redirect data inserted into the master table to the appropriate partition.
Ensure hte constraint_exclusion configuration parameter is not disabled in postgresql.conf. If it is, queries will not be optimized as desired.
As we are creating new table and hopping data insered to right partition, a trigger function and a trigger are needed.
I’ll come across many hg branch-switching or log searching tasks during my work. Using bash functions and awk greatly reduce the time I spend on dealing with these tasks. So let’s see what these functions can do to help me improve my productivity.
Bash functions are scripts like alias, but can do much a lot than aliasThe first kind of usages of function is to run several scripts continuously, the same as && I guess:
1 | function run { |
I activate django virtural enviroment first and then run django serve.
Functions, like in any other languages, can take parameters and returns a result. So I can use this ability to do a liitle more complex work:
1 | function ba { hgba | grep "$1" } |
I can just type ba release then get current release branch info.
Append:
I made some updates to the ba function and make it auto copying the branch result to my clipboard:
1 |
|
awk is a command I just know recently. I need to process a bunch of logs and analyze them. What I used to do is download the logs, grep things I want into a txt file and then process the txt file with python.
Tastypie can be thought of as a set of class-based view that provide the API functionality. All routing/middleware/response-handling aspectss are the same as a typical Django app. Where the differs is in the view itself.
Walking through what a GET request to a list endpoint looks like:
The Resource.urls are checked by Django’s url resolvers.
On a match for the list view, Resource.wrap_view('dispatch_list') is called. wrap_view provides basic error handling & allows for returning serilized errors.
Because dispatch_list was passed to wrap_view, Resource.dispatch_list is called next. This is a thin wrapper around Resource.dispatch.
dispatch does a bunch of havy lifting. It ensures:
allowed_methos (method_check).get_list)is_authenticated)throttle_check).At this point, dispatch actually calls the requested method (get_list).
get_list does the actual work of API. It does:
Resource.obj_get_list. In the case of ModelResource, this builds the ORM filters to apply (ModelResource.build_filters). It then gets the QuerySet via ModelResource.get_object_list (which performs Resource.authorized_read_list to possibly limit the set the user can work with) and applies the built filters to it.ModelResource.apply_sorting).Paginator & pulls out the data to be serialized.full_dehydrate applied to each of them, causing Tastypie to traslate the raw object data into the fields the endpoint supports.Resource.create_response.create_response is a shortcut method that:
Resource.determine_format).HttpResponse (200 OK) with the serialized data.We bubble back up the call stack to dispatch. The last thing dispatch does is potentially store that a request occured for future throttling (Resource.log_throttled_access) then either returns the HttpResponse or wraps whatever data came back in a response (so Django doesn’t freak out).
Resources are the heart of Tastypie. By defining a resource we can actually convert a model into an API stream. The data is automatically converted into API response.
Understanding the process of creating a resource.
Add API URL in the urls.py of app.

Dehydration in Tastypie means making alterations before sending data to the client. Suppose we need to send capitalized product names instead of small letters. Now we see two kinds of dehydrate methods.
This dehydrate_field is uesd to modify field on the response JSON.
Dehydrate method is useful for aadding additional fields to bundle (response data).
Similarly using hydrate method we can alter the bundle data which is generated from request at the time of PUT or POST methods.
Django 里会为每一个 model 生成一个 Manager,默认名字为 objects,一般情况下对 model 进行的处理都是通过 model.objects.XXX( ) 来进行的。其实是调用了 model 的 manager 的方法,而 manager 之中的方法是 QuerySet 方法的代理,QuerySet 方法是对数据库操作的封装。
eg.
1 | from django.db import models |
上面这个 model,Person.objects会产生一个AttributeError,但是Person.people就可以正常操作。因为默认的 manager 已经变成 people,objects 这个 manager 没有重新声明,不起作用。
通常需要自定义 manager 的情况有两点:
如果使用自定义的 manager 需要注意的是,Django 将 model 中定义的第一个 manager 认为是默认 manager,而且 Django 框架中会用到默认 manager。
笨方法是使用自定义 manager 的时候,对于 model 依然提供 objects 这个默认 manager,并放在第一个。
eg.
1 | class Book(models.Model): |
To handle web forms we use Flask-WTF . So we need to write a config file (file config.py):
1 | WTF_CSRF_ENABLED = True |
And then you need to use this config (file app/__init__.py):
1 | from flask import Flask |
Let’s build a simple form (file app/forms.app):
1 | from flask.ext.wtf import Form |
The DataRequired() is a validator that checks the field is empty or not.
After that, we need a HTML page to show the form (file app/templates/login.html):
1 | <!-- extend from base layout --> |
The final step is to code a view function that renders the template and receiving data from form (file app/views.py):
1 | from flask import render_template, flash, redirect |