TL;DR
What is Max?Max is a data exploration tool that uses natural conversation to quickly provide answers to questions about business performance data. It parses data faster than humans and turns raw data into insightful stories. Max provides customized responses and insights tailored to the specific needs and characteristics of your business.What Data Can Max Analyze?
- Max is best suited for analyzing financial, market, and operational metrics to measure, evaluate, and manage the effectiveness and efficiency of a company’s operations.
- Max analyzes KPIs to answer questions about the overall health of the business, provide opportunities or corrections, identify changes of key business indicators, help achieve a goal, and more.
- Your business data can be provided to Max in the form of a CSV file or through a connection to Redshift, Microsoft Azure, Google BigQuery, or Databricks.
What can Max do?
Max allows you to explore your data through natural conversation. Much like the powers of OpenAI’s Chat GPT, but specific to your data and use case, Max can quickly provide answers to questions about anything pertaining to your data.
Max understands your business and data, parses data faster than any human, and turns your raw data into insightful stories.
Max uses your data to provide customized responses and insights that are tailored to your specific needs and characteristics. For example, if the dataset contained information about market performance for the bottled water industry, Max could provide information about which brands are performing best in each subcategory, how each brand is trending over time, the drivers behind growth or decline of each metric, and opportunities for each brand within the market space.
Overall, Max provides valuable insights and recommendations that are tailored to your unique needs and circumstances.
What type of data should I use with Max?
Max works best with analyzing business performance datasets. Business performance data refers to the collection of quantitative and qualitative information that reflects how well a company is performing in various areas of its operations. This data is used to measure, evaluate, and manage the effectiveness and efficiency of a business, and helps in making informed decisions to drive growth, profitability, and sustainability.
Business performance data can encompass a wide range of metrics, including but not limited to:
- Financial Metrics: These include revenue, net income, gross profit margin, return on investment (ROI), and other financial ratios that indicate a company's profitability and financial health.
- Market Metrics: These evaluate the company's position in its industry and its competitive landscape, such as market share, growth rate, and brand recognition.
- Operational Metrics: These measure the efficiency and effectiveness of a company's core business processes, such as production, inventory management, and customer service. Examples include throughput, lead time, order-to-delivery time, and on-time delivery rates.
- Customer Metrics*: These reflect the company's ability to satisfy and retain customers, such as customer satisfaction scores, customer retention rates, customer lifetime value, Net Promoter Score (NPS), and cohort analysis.
-
Human Resource Metrics*: These provide insights into the company's workforce and how well it is managed, including employee turnover, absenteeism, productivity, and employee satisfaction.
*Metrics marked with an * indicate that these are not currently ideal for use in Max but may be suited for analysis in the future.
Business performance data is often tracked and analyzed using Key Performance Indicators (KPIs), which are specific, measurable, and relevant metrics that help businesses focus on critical aspects of their performance. Max will use this business performance data to answer questions about the overall health of the business, provide opportunities or corrections, identify changes of key business indicators, help achieve a goal, and more.
Please note that Max currently only supports a single fact table.
What makes the ideal dataset?
For the ideal analysis experience with Max, here are some things to consider about your dataset.
Please note that Max will not function without a date column, at least one metric column, and clear column headers.
The More Data, the Better
The more data you have, the more value you will get from Max. We recommend having the following present in your dataset:
- At least 2 years of historical data. Having 2 plus years of data will provide much more accurate and valuable predictions for the future when Max.ai is forecasting metrics.
- The more metrics, the higher the value of analysis. Having several metrics in your data will add value to your analysis by allowing you to get a broader view of your data and compare performance across different metrics.
- More dimensions for additional viewpoints. Additional dimensions in your dataset will provide more variety for breaking analysis out and seeing different views. If you were to upload a dataset of “Sales by State” most of the information that Max might provide would be simplistic insights that you may have been able to ascertain yourself. However, if you upload a dataset of “Sales by State, Store Associate, and Product Category” your insights will be far more valuable and robust and might provide information that you wouldn’t have otherwise discovered.
Data Structure
While column headers being present is all that is required to upload your data, headers that clearly explain the data within that column will make it easier for Max to provide value. Since Max.ai is not doing any data mapping, all data definitions will come from your column headers. Keep this in mind when evaluating the complexity of your data structure.
Examples of possible data structures from best to worst -
- Ideal: Category > subcategory > product would be a simple hierarchy
- Workable: Category > subcategory OR segment > product
- Poor: Category > subcategory OR segment > product, where product can belong to multiple subcategories or segments
Qualities of Ideal Data
The ideal dataset is defined by being consistent, granular, compatible, complete, and unique.
Consistent: The data needs to be consistent in format, structure, and labeling. For example, if you're aggregating sales data from different stores, all the stores should have the same types of sales data recorded in the same format.
In the below example we can see that Order ID, Order Date, and Customer ID are all recorded in consistent formats making it easy for Max to analyze the data without having to convert any items to match one another.
Versus in this example, we have two different stores recording their Order IDs, Customer IDs, and States in different formats. Order IDs are still unique and will therefore have no impact on Max's analysis. If a customer shops at both stores, they would have two unique customer IDs, one in each format, that Max would not be able to reconcile as the same customer since the formatting of the customer ID is inconsistent. Max would not be able to look at aggregated metrics by state since each store is recording state in a different format. (For example, Max would not combine sales from TX and Texas as being in the same state.)
Granular: The data needs to be at the same level of granularity. For example, if you're aggregating sales data from different stores, all the stores should have the same level of detail recorded, such as daily or weekly sales totals. Max wants the most granular data that you have without you also providing an aggregate of those numbers. For instance, if you have city level data, do not also provide state level or country level. The granularity should be consistent across all the metrics in the dataset. So only go as granular as the least granular level that each metric has.
In this example we see that every sale is broken down at the Order ID level, even if each customer has multiple orders. This allows Max to analyze trends within customer spending and look at the items, subcategories, and categories that each customer is buying from.
Here, sales are only as granular as the customer level. If a customer has multiple orders in their lifetime, they are totaled up by customer and Max cannot analyze anything at the individual order level.
Please note that you should not include total rows (for example, weekly sales if you have daily sales data) as Max will calculate the totals as needed and duplicative counting may occur if totals or subtotals are included in the data.
Compatible: The data needs to be in a format that can be easily combined. For example, if you're aggregating sales data from different stores, they should all be in the same format, currency, etc.
In the below data, we see that all sales data is reportedly neatly and in the same numerical format and currency.
Versus here, sales are being reported in several formats and multiple currencies making it difficult for Max to provide complete or accurate analysis of sales overall.
Complete: The data needs to be complete. If there are missing values or incomplete records, it may be difficult to aggregate the data accurately. For instance if you have products, subcategories, and categories, but subcategories aren’t provided for each product, then Max will have limited accuracy at the subcategory level without all of the data being present.
Below we see an example of what ideal data would look like. All fields are complete for each data entry making it easy for Max to interpret and analyze the data.
In this dataset however, there are many missing entries including missing Order IDs, Product IDs, categories, and subcategories. This would make it difficult for Max to perform any sort of analysis about how products, categories, or subcategories are performing.
Unique: The data needs to be uniquely identifiable. For example, if you're aggregating customer data from different sources, each customer should have a unique identifier such as a customer ID number or email address.
In the example shown here, every order has a unique Order ID and each customer is assigned a unique Customer ID. This prevents confusion or overlap if you have multiple customers with the same name for instance.
Updated