Each row reflects the popularity score (log-views) of the corresponding online post.
Popularityscore ... 3.2 2.3 ...
Each row is the path of the corresponding photo or video
... "train/59@N75/775.jpg" "train/1@N18/1075.jpg" "train/351@N64/4890.jpg" ...
Each row corresponds a category set of the online post.
{
"Uid": "70478@N10",
"Pid": 564687,
"Category": "Whether&Season",
"Subcategory": "Raining",
"Concept": "umbrella"
}
Uid: the user this post belongs to.
Pid: the photo along with the post. One Pid can locate a particular post.
Category: the first category of the post.(11 classes)
Subcategory: there are 77 classes in 2nd level category.
Concept: there are 668 different description.
Each row represents a text information of the post.
{
"Uid": "70478@N10",
"Pid": "564687",
"Tile": "Sarah Moon 3",
"Mediatype": "photo",
"Alltags": "black" "hat" "fashion" "yellow" "umbrella"
}
Title: the tile of the post defined by the user.
Mediatype: the type of the attached media file, including 'photo' and 'video'.
Alltags: the customized tags from users.
Each row offers the date and geographic information of the post.
{
"Uid": "25893@N22",
"Pid": "565381",
"Latitude": "52.520213",
"Longitude": "13.373097",
"Geoaccuracy": "16"
}
Postdate: the publish timestamp of the post. It can be converted to Datetime by following python code:
import time
timestamp = 1457068974
timeArray = time.localtime(timestamp)
datetime = time.strftime("%Y-%m-%d %H:%M:%S", timeArray)
Latitude: the latitude whose valid range is -90 to 90. Anything more than 6 decimal places will be truncated.
Longitude: the longitude whose valid range is -180 to 180. Anything more than 6 decimal places will be truncated.
Geoaccuracy: recorded accuracy level of the location information. World level is 1, Country is ~3, Region ~6, City ~11, Street ~16. The current range is 1-16. Defaults to 16 if not specified.
Each row contains the user data of the post.
{
"photo_firstdate": "1213743830",
"photo_count": "6828",
"ispro": "1",
"canbuypro": "0",
"timezone_offset": "1"
"photo_firstdatetaken": "1904010100"
"timezone_id":"9"
"user_description":"0.0866962,-0.0752717,..."
"location_description":"0,0,..."
}
Photo_firstdate: the date of the first photo uploaded by the user.
Photo_count: the number of posted photo by the user.
Ispro: is the user belong to pro member.
Photo_firstdatetaken: the date of the first photo taken by the user.
Timezone_offset: the time zone of the user.
User description: the feature used to describe the user data.
Location description: the feature used to describe the user location.
Each row offers the supplimental information of the post.
{
"Uid": "25893@N22",
"Pid": "565381",
"Pathalias": "hoo_nose_68",
"Ispublic": "1",
"Mediastatus": "ready"
}
Pathalias: the path alias provided by the user.
Ispublic: indicates that the post is authenticated with 'read' permissions.
Mediastatus: indicates that the attached media is ready to access by others.