Recommendations Using Redis
Recommendations Using Redis
CONTENTS
      Recommendation systems need not always involve complex                 WHAT IS A RECOMMENDATION ENGINE?
      machine learning techniques. With enough data in hand, one
                                                                            A recommendation engine is an application or microservice that
      can develop a recommendation system with little effort. One
                                                                            presents users with the choices they are most likely to make next.
      of the simplest recommendation systems is just a lookup table
                                                                            Recommendations could include the next song a user is likely to
      based on user interests. When you have data for many users
                                                                            want to hear, the next movie that they might watch, or another
      and their behaviors, collaborative filtering is an easy solution
                                                                            action that they may choose to take after completing a reservation.
      to make recommendations. For example, for an e-commerce
      site with collaborative filtering, you can determine which users
                                                                            At a system level, recommendation engines match users with
      that purchased a sleeping bag have also purchased a flashlight,
                                                                            items in which they are most likely to be interested. By pushing
      lantern, and bug repellent. Content-based recommendation
                                                                            relevant, personalized recommendations to users, applications
      systems go a step further and incorporate greater sophistication
                                                                            can encourage users to purchase relevant items, increase their
      in predicting what a user is likely to want, based on that user’s
                                                                            time spent on a site or in the app, or click on the right ads –
      interactions. This article demonstrates how to develop simple
                                                                            ultimately helping maximize revenues, usage, or eyeballs.
      recommendation systems in Redis based on user-indicated
      interests and collaborative filtering.
                                                                            Effective recommendation engines need to meet the following
                                                                            criteria:
       WHAT IS REDIS?
      Redis is an in-memory, NoSQL data store, frequently used                1.	 Generate the right and relevant choices for their users (this
      as a database, cache, and message broker. Unlike other in-                  usually depends on the algorithm chosen).
      memory data stores, it can persist your data to a disk, and can
                                                                              2.	 Provide high performance, with choices presented to users
      accommodate a wide variety of data structures (Sets, Sorted
                                                                                  in real-time.
      Sets, Hashes, Lists, Strings, Bit Arrays, HyperLogLogs, and
      Geospatial Indexes). Redis commands enable developers to                3.	 Be efficient with system resources, as with any well-written
      perform highly performant operations on these structures with               application.
      very little complexity. In other words, Redis is purpose-built for
      performance and simplicity.                                           At a system level, recommendation engines match users with
                                                                            items in which they are most likely to be interested. By pushing
       DATA STRUCTURES IN REDIS
                                                                            Continued on next page
      Data structures are like Lego building blocks; helping developers
      achieve specific functionality with the least amount of complexity,
      least overhead on the network, and lowest latency because
      operations are executed extremely efficiently in memory, right
      next to where the data is stored.
      Data structures set Redis apart from other key/value stores and
      NoSQL databases in terms of versatility and flexibility. Redis data
      structures include:
       •	 Strings
       •	 Hashes
       •	 Lists
       •	 Sets
       •	 Sorted sets
       •	 Bit Arrays
       •	 HyperLogLogs
       •	 Geospatial Indexes
      COMMON TYPES OF RECOMMENDATION ENGINES                                                    SINTER key [key …]        Intersect multiple sets.
     The most common recommendation engines are based on user-                                  SINTERSTORE               Intersect multiple sets
     chosen profile settings, collaborative filtering, and content-based                        destination key           and store the resulting
     recommendations.                                                                           [key …]                   set in a key.
                                                                                                                          Determine if a given
     Recommendations based on user-chosen profile settings are the                              SISMEMBER key
                                                                                                                          value is a member of a
                                                                                                member
     simplest to implement. However, they are static; they don’t take                                                     set.
     user behavior into account or try to understand what’s being                                                         Get all the members of
     recommended.                                                                               SMEMBERS key
                                                                                                                          a set.
     Collaborative filtering works well when you have many users and                            SREM key member           Remove one or more
     have collected enough information to understand and categorize                             [member …]                members from a set.
     your users based on their behavior. Collaborative filtering is quite
                                                                                                SUNION key [key …]        Add multiple sets.
     effective and can generate surprisingly interesting results. On the
     flip side, they can be computationally heavy.                                              SUNIONSTORE               Add multiple sets and
                                                                                                destination key           store the resulting set in
     Content-based recommendations rely on Machine Learning                                     [key …]                   a key.
     and an understanding of the dimensions of user attributes
                                                                                                                          Incrementally iterate Set
     and the attributes of the items that are being recommended.                                SSCAN key cursor
                                                                                                                          elements.
     Preparing the right data model is often a tricky and avery lengthy
                                                                                                                          Add one or more
     process. However, with the right data model, content-based                                 ZADD key score
                                                                                                                          members to a sorted set,
     recommendations can produce great results with little historical             Sorted Sets   member [score
                                                                                                                          or update its score if it
     data or only a few users in the system.                                                    member…]
                                                                                                                          already exists.
                                                                                                                          Get the number of
      REDIS DATA STRUCTURES AND COMMANDS FOR                                                    ZCARD key
                                                                                                                          members in a sorted set.
      RECOMMENDATIONS
                                                                                                                          Count the members in
     Redis data structures can tremendously reduce application                                  ZCOUNT key min max        a sorted set with scores
     complexity while delivering very high performance at high scale.                                                     within the given values.
     Recommendation solutions often need Set operations such as                                 ZINCRBY key               Increment the score of a
     intersection, union, and set difference to be executed very quickly.                       increment member          member in a sorted set.
     Redis data structures such as Strings, Sets, and Sorted Sets                                                         Intersect multiple sorted
                                                                                                ZINTERSTORE
     come in handy while implementing a recommendation solution.                                destination numkeys       sets and store the
     Also, being an in-memory database platform, Redis delivers very                            key [key …]               resulting set in a key.
     high performance with sub-millisecond latency with very little                                                       Return a range of
     computational resources.                                                                   ZRANGE key start
                                                                                                                          members in a sorted set
                                                                                                stop [WITHSCORES]
                                                                                                                          by index.
     Before we begin setting up the recommendation system, let’s
                                                                                                ZRANGEBYSCORE             Return a range of
     get acquainted with some of the Redis data structures and their                            key min max               members in a sorted set
     commands:                                                                                  [WITHSCORES]              by scores.
                                                                                                                          Determine the index of a
         DATA                                                                                   ZRANK key member
                            COMMAND                    DESCRIPTION                                                        member in a sorted set.
      STRUCTURE
                                                                                                ZREM key member           Remove one or more
      Strings        GET key                    Get the value of the key.                       [member …]                members from a sorted set.
                                                                                                                          Return a range of
                                                Set key to hold the string                      ZREVRANGE key
                     SET key value                                                                                        members in a sorted
                                                value.                                          start stop
                                                                                                                          set by index with scores
                                                                                                [WITHSCORES]
                     SADD key member            Add one or more                                                           ordered from high to low.
      Sets
                     [member …]                 members to a set.                                                         Return a range of
                                                                                                ZREVRANGEBYSCORE
                                                                                                                          members in a sorted set
                                                Get the number of                               key max min
                     SCARD key                                                                                            by score, with scores
                                                members in a set.                               [WITHSCORES]
                                                                                                                          ordered from high to low.
                     SDIFF key [key …]           Subtract multiple sets.
                                                                                 Continued on next page
          DATA
                                                                                  they are interested in. The backend of the application tracks all
                             COMMAND                     DESCRIPTION              the products on sale for each category. When a customer walks
       STRUCTURE
                                                   Determine the index of a       into the store and opens the application, that customer will
                      ZREVRANK key                 member in a sorted set         receive personalized, targeted coupons. The data structures are
                      member                       with scores ordered from       shown in this example:
                                                   high to low.
                                                   Get the score associated         categories = {organic, dairy,… }
                      ZSCORE key member            with the given member in         category:organic:items = {milk, carrots, tomatoes, …}
                                                   a sorted set.                    category:dairy:items = {milk, butter, cheese, …}
                                                                                    user:U1:categories = {organic, dairy}
                      ZUNIONSTORE         Add multiple sorted sets                  user:U2:categories = {dairy}
                      destination numkeys and store the resulting
                      key [key …]         set in a new key.
                                                                                  When user U1 opens her application, she will receive promotions
                                                   Incrementally iterate
                                                                                  related to the following items:
                      ZSCAN key cursor             sorted sets, elements,
                                                   and associated scores.
                                                                                    SUNION category:organic:items category:dairy:items
                                                                                    = {milk, carrots, tomatoes, butter, cheese, …}
     STEP 5                                                                       2.	 For each candidate, calculate a score using the Root Mean Square
     Get all the items that belong to the categories in which the user is             (RMS) of the difference between their mutual item ratings.
     interested.
                                                                                  3.	 Store the top similar users for each individual user.
     The grocery store will recommend carrots to U1.                            STEP 3 Calculate similarity for each candidate
                                                                                Find the difference between <user id> and others in the list. This
      COLLABORATIVE FILTERING BASED ON USER-                                    example uses ZMEMBERS assuming a small dataset. Use ZSCAN
      ITEM ASSOCIATIONS AND THEIR RATINGS                                       when working with a large dataset.
       1.	 Find all users that rated at least one (or N) common item(s)         STEP 4 Getting the candidate items
           as the user, and use them as candidates.                             Now that we have a sorted set of users similar to U1, we can
     extract the items associated with those users and their ratings.           The item that has the highest score is recommended to U1. In our
     We’ll do this with ZUNIONSTORE with all U1’s similar users, but            example, the store recommends carrots to U1.
     then we need to make sure we exclude all the items U1 has
     already rated.                                                              ADVANCED RECOMMENDATIONS
     We’ll use weights again, this time with the AGGREGATE option               Collaborative filtering is a good technique when you have a large
     and ZRANGEBYSCORE command. Multiplying U1’s items by -1 and                dataset pertaining to user behavior. Collaborative filtering is
     all the others by 1, and specifying the AGGREGATE MIN option will          generic, and doesn’t dig into the content of the item being
     yield a sorted set that is easy to cut: All U1’s item scores will be       recommended. This technique works fine when many users share
     negative, while the other user’s item scores will be positive. With        common interests. Content-based recommendations, on the other
     ZRANGEBYSCORE, we can fetch the items with a score greater                 hand, are tedious. They are most effective when incorporating
     than 0, returning only those items that U1 has not rated.                  predictive analytics and machine learning techniques. Redis-ML
                                                                                offers categorizing techniques using tree ensembles such as
     Assuming <user id 1> with similar users <user id 3>, <user id 5>,          Random Forest.
     <user id 6>:
                                                                                The pseudo-code below illustrates how we can use the Redis-
       ZUNIONSTORE recommendations:<user id 1> 4 user:<user id                  ML module for recommendations. The code assumes you have
       1>:items user:<user id 3>:items user:<user id 5>:items
                                                                                already generated a model on Apache Spark and loaded the
       user:<user id 6>:items WEIGHTS -1 1 1 1 AGGREGATE MIN
                                                                                model into Redis. Apache Spark provides you the necessary
     SAMPLE SCENARIO: The local grocery store’s mobile app                      tools to create and train a Machine Learning (ML) module.
     for recommendations                                                        When you load an Apache Spark ML model into Redis, Redis-
                                                                                ML automatically translates the Spark ML model into Redis data
     The grocery chain now decides to add yet another feature within            structures and makes it available for serving immediately.
     its application. It allows the customers to rate the items on a
                                                                                The idea in the code is to:
     scale from 1 to 5. The customers who purchase similar items and
     rate them in a similar fashion would be more closely related as              1.	 Get the user profile from Redis.
     the store starts promoting items based not just based on their
     purchasing behavior, but also on how they rate those items.                  2.	 Fetch the user’s interest categories. We can allow the user
                                                                                      to select the categories they are interested in, or compute
     The data structures would look like:                                             the categories based on their purchase history, or both.
       userid:U1:items = {(milk, 4), (bananas, 5)}                                3.	 Retrieve all the items that belong to the interested
       userid:U2:items = {(milk, 3), (carrots, 4), (bananas, 5)}
                                                                                      categories.
       userid:U3:items = {(milk, 5)}
       item:milk:scores = {(U1, 4), (U2, 3), (U3, 5)}
       item:bananas:scores = {(U1, 5), (U2, 5)}                                   4.	 For each item, calculate the score in the Random Forest
       item:carrots:scores = {(U2, 4)}                                                classifier (RedisRandomForestClassfy).
       ZRANGE user:U1:items 0 -1                                                  5.	 Sort the items based on the rating, and recommend the
       = {(milk, 4), (bananas, 5)}
                                                                                      highest item with the highest rating.
       ZUNIONSTORE user:U1:same_items 2 item:milk:scores
       item:bananas:scores
                                                                                  void setRecommendationsByInterests(String userid){
       user:U1:same_items = {(U1, 9), (U2, 8), (U3, 5)}
                                                                                  	       // Sample data: “age:31”, “sex:male”,
                                                                                  	       “food_1:pizza”, “food_2:sriracha”
       ZINTERSTORE rms:U1:U2 2 user:U1:items user:U2:items
                                                                                  	       String[] featureVector = redis.call(“hget”,
       WEIGHTS 1 -1
       ZINTERSTORE rms:U1:U3 2 user:U1:items user:U3:items                        	       userid+”:profile”);
       WEIGHTS 1 -1
       rms:U1:U2 = {(bananas, 0), (milk, 1)};                                     	
       rms:U1:U3 = {(milk, -1)};                                                  	       Category[] userInterestCategories = redis.
       RMS of rms:U1:U2 = 0.7                                                     	call(“smembers”,
       RMS of rms:U1:U3 = 1                                                       	“interest_categories:”+userid);
                                                                                  	
                                                                                  	       // For each category we have a machine learning
     From the above calculation, we can conclude that U2 is closer to
                                                                                  	       model that will recommend
     U1, than U3 is to U1. However, for our calculations, we will choose
                                                                                  	       // the most suitable items according to the users
     RMS values less than or equal to 1. Therefore, we will consider the          	       feature vector.
     ratings of both U2 and U3.                                                   	       // The models are trained on Spark and stored on
                                                                                  	       Redis ML.
       ZUNIONSTORE recommendations:U1 3 user:U1:items                             	       for(category in userInterestCategories){
       user:U2:items user:U3:items WEIGHTS -1 1 1        AGGREGATE MIN            		                // Get all items of this category
       recommendations:U1 = {(bananas, -5), (milk, -4),                           		                String[] items = redis.call(“smembers”,
       (carrots, 4)}
                                                                                Code continued on next page
                              A B O U T T H E AU T H O R
                              ROSHAN KUMAR is a Senior Product Marketing Manager at Redis Labs, Inc. He has extensive
                              experience in software development and marketing in the technology sector. In the past, Roshan has
                              worked at Hewlett-Packard, and many successful Silicon Valley startups – ZillionTV, Salorix, Alopa, and
                              ActiveVideo to name a few. As an enthusiastic programmer, he designed and developed mindzeal.
                              com, an online platform hosting computer programming courses for young students. Roshan holds a
                              Bachelor’s degree in Computer Science, and MBA from Santa Clara University, California, USA.
DZone communities deliver over 6 million pages each month to more than 3.3 million
software developers, architects and decision makers. DZone offers something for
everyone, including news, tutorials, cheat sheets, research guides, feature articles,
source code and more.
                                                                                                                                                     DZONE, INC.         REFCARDZ FEEDBACK
"DZone is a developer's dream," says PC Magazine.                                                                                    150 PRESTON EXECUTIVE DR.                     WELCOME
                                                                                                                                                                         refcardz@dzone.com
                                                                                                                                                CARY, NC 27513
Copyright © 2017 DZone, Inc. All rights reserved. No part of this publication may be reproduced, stored in a retrieval                                                          SPONSORSHIP
system, or transmitted, in any form or by means electronic, mechanical, photocopying, or otherwise, without prior                                888.678.0399                 OPPORTUNITIES
written permission of the publisher.                                                                                                             919.678.0300               sales@dzone.com