Airbnb Data Dictionary

Explore our comprehensive Airbnb dataset with detailed short-term rental statistics, pricing analytics, and market insights for investors, researchers, and property managers.

Listings Data

Comprehensive details of all Airbnb listings providing essential insights into property distribution, amenities, pricing strategies, and competitive positioning across markets.

Common Use Cases

  • Market analysis and competitive positioning
  • Property type and amenity distribution analysis
  • Pricing strategy development based on property attributes
  • Geographical analysis of listing density and characteristics

Sample Visualizations

  • Heat maps showing property density by neighborhood
  • Price distribution charts by property type
  • Amenity correlation matrices
  • Property type distribution pie charts

Schema

38 fields
Field NameDescription
listing_idUnique identifier for the listing
host_idUnique identifier for the host
host_nameName of the host
listing_nameTitle of the listing
latitudeGeographical latitude coordinate
longitudeGeographical longitude coordinate
guestsMaximum number of guests allowed
bedroomsNumber of bedrooms available
listing_typeType of property (e.g., apartment, house, villa)
room_typeType of room (e.g., entire home, private room)
bedsNumber of beds available
bathsNumber of bathrooms available
min_nightsMinimum number of nights required to book
num_reviewsTotal number of reviews received
star_ratingOverall star rating of the property
cover_photo_urlURL of the main listing photo
photos_countNumber of photos available for the listing
superhostWhether the host is a superhost
cohostWhether the property has a co-host
registrationRegulatory registration or license number required for operating the listing
amenitiesList of amenities offered
cancellation_policyType of cancellation policy offered
tierQuality classification (Basic, Plus, or Luxury tier) of the listing
rating_overallOverall rating score
rating_accuracyRating score for listing accuracy
rating_communicationRating score for host communication
rating_cleanlinessRating score for cleanliness
rating_locationRating score for location
rating_checkinRating score for check-in experience
rating_valueRating score for value
currencyCurrency used for pricing
ttm_vacant_daysNumber of vacant days in trailing twelve months
ttm_booked_daysNumber of booked days in trailing twelve months
ttm_occupancyOccupancy rate in trailing twelve months
ttm_revenueTotal revenue in trailing twelve months
ttm_native_revenueTotal revenue in native currency in trailing twelve months
ttm_avg_rateAverage daily rate in trailing twelve months
ttm_avg_native_rateAverage daily rate in native currency in trailing twelve months

Calendar Rates

Availability and pricing information crucial for understanding occupancy patterns, pricing strategies, seasonal variations, and special event impacts.

Common Use Cases

  • Seasonal pricing pattern analysis
  • Occupancy rate calculations and forecasting
  • Special event pricing impact studies
  • Dynamic pricing strategy development

Sample Visualizations

  • Occupancy rate calendars by market
  • Price fluctuation charts throughout the year
  • Special event pricing premium analysis
  • Booking window visualization by season

Schema

13 fields
Field NameDescription
listing_idUnique identifier for the listing
dateFirst day of the month for aggregated monthly data
vacant_daysNumber of days the property was vacant
booked_daysNumber of days the property was booked
occupancyOccupancy rate
revenueTotal revenue generated during the month
rate_avgAverage daily rate
booked_rate_avgAverage rate when booked
booking_lead_time_avgAverage booking lead time in days
min_nights_avgAverage minimum nights requirement
native_booked_rate_avgAverage rate when booked in native currency
native_rate_avgAverage daily rate in native currency
native_revenueRevenue generated in native currency

Reviews Data

Guest reviews and ratings with sentiment analysis providing invaluable insights into guest satisfaction, property performance, and host-guest interactions.

Common Use Cases

  • Guest satisfaction analysis by property type or location
  • Sentiment trend analysis over time
  • Common complaint and praise identification
  • Correlation between amenities and positive reviews

Sample Visualizations

  • Sentiment score heat maps by neighborhood
  • Word clouds of most common positive/negative terms
  • Rating trends over time by property category
  • Review volume seasonality charts

Schema

4 fields
Field NameDescription
listing_idUnique identifier for the listing
dateFirst day of the month when reviews were aggregated
num_reviewsNumber of reviews for the listing
reviewersList of reviewer IDs

Host Data Coming Soon

Detailed host information revealing behaviors, performance metrics, and profile characteristics to understand host professionalism, experience levels, and management practices.

Common Use Cases

  • Professional vs. amateur host analysis
  • Superhost performance metrics and characteristics
  • Multi-property host portfolio analysis
  • Host listing growth patterns over time

Sample Visualizations

  • Distribution of hosts by property count
  • Superhost percentage by neighborhood
  • Host performance comparison by experience level
  • Host experience timeline analysis

Schema

11 fields
Field NameDescription
host_idUnique identifier for the host
host_nameName of the host
is_hostWhether the user is a host
is_superhostWhether the host is a superhost
ratingsHost rating score
reviews_countNumber of reviews received by the host
listing_countNumber of properties managed by host
member_sinceDate the host joined the platform
languagesLanguages spoken by the host
profile_pictureURL to host profile image
aboutHost self-description and biography

Data Quality Commitment

We are committed to providing the highest quality data for your research and business needs. Our rigorous data collection and processing methodology ensures:

Comprehensive Coverage

Our data collection process captures over 95% of all active listings in each market, ensuring you have the complete picture.

Regular Updates

All datasets are updated monthly, with timestamps indicating the exact collection date for transparency.

Data Cleaning

Our automated and manual cleaning processes remove duplicates, correct errors, and standardize formats for consistency.