Each row reflects the popularity score (log-views) of the corresponding online post.
Popularityscore ... 3.2 2.3 ...
Each row is the path of the corresponding photo or video
... "train/59@N75/775.jpg" "train/1@N18/1075.jpg" "train/351@N64/4890.jpg" ...
Each row corresponds a category set of the online post.
{ "Uid": "70478@N10", "Pid": 564687, "Category": "Whether&Season", "Subcategory": "Raining", "Concept": "umbrella" }
Uid: the user this post belongs to.
Pid: the photo along with the post. One Pid can locate a particular post.
Category: the first category of the post.(11 classes)
Subcategory: there are 77 classes in 2nd level category.
Concept: there are 668 different description.
Each row represents a text information of the post.
{ "Uid": "70478@N10", "Pid": "564687", "Tile": "Sarah Moon 3", "Mediatype": "photo", "Alltags": "black" "hat" "fashion" "yellow" "umbrella" }
Title: the tile of the post defined by the user.
Mediatype: the type of the attached media file, including 'photo' and 'video'.
Alltags: the customized tags from users.
Each row offers the date and geographic information of the post.
{ "Uid": "25893@N22", "Pid": "565381", "Latitude": "52.520213", "Longitude": "13.373097", "Geoaccuracy": "16" }
Postdate: the publish timestamp of the post. It can be converted to Datetime by following python code:
import time timestamp = 1457068974 timeArray = time.localtime(timestamp) datetime = time.strftime("%Y-%m-%d %H:%M:%S", timeArray)
Latitude: the latitude whose valid range is -90 to 90. Anything more than 6 decimal places will be truncated.
Longitude: the longitude whose valid range is -180 to 180. Anything more than 6 decimal places will be truncated.
Geoaccuracy: recorded accuracy level of the location information. World level is 1, Country is ~3, Region ~6, City ~11, Street ~16. The current range is 1-16. Defaults to 16 if not specified.
Each row contains the user data of the post.
{ "photo_firstdate": "1213743830", "photo_count": "6828", "ispro": "1", "canbuypro": "0", "timezone_offset": "1" "photo_firstdatetaken": "1904010100" "timezone_id":"9" "user_description":"0.0866962,-0.0752717,..." "location_description":"0,0,..." }
Photo_firstdate: the date of the first photo uploaded by the user.
Photo_count: the number of posted photo by the user.
Ispro: is the user belong to pro member.
Photo_firstdatetaken: the date of the first photo taken by the user.
Timezone_offset: the time zone of the user.
User description: the feature used to describe the user data.
Location description: the feature used to describe the user location.
Each row offers the supplimental information of the post.
{ "Uid": "25893@N22", "Pid": "565381", "Pathalias": "hoo_nose_68", "Ispublic": "1", "Mediastatus": "ready" }
Pathalias: the path alias provided by the user.
Ispublic: indicates that the post is authenticated with 'read' permissions.
Mediastatus: indicates that the attached media is ready to access by others.