Class AdvancedAnalysisService

java.lang.Object
com.leumanuel.woozydata.service.AdvancedAnalysisService

public class AdvancedAnalysisService extends Object
Service class that provides advanced statistical analysis capabilities for DataFrames. This class includes methods for time series analysis, outlier detection, missing value analysis, and comprehensive statistical reporting.
  • Constructor Details

    • AdvancedAnalysisService

      public AdvancedAnalysisService()
  • Method Details

    • timeAnalysis

      public DataFrame timeAnalysis(DataFrame df, String dateCol, String valueCol)
      Performs time series analysis on a DataFrame with date and value columns.
      Parameters:
      df - The DataFrame containing the time series data
      dateCol - The name of the column containing date/time values
      valueCol - The name of the column containing numeric values to analyze
      Returns:
      DataFrame containing time series analysis results including trends, autocorrelation, and basic statistics
    • outlierAnalysis

      public DataFrame outlierAnalysis(DataFrame df, String... columns)
      Analyzes outliers in specified numeric columns using both Z-score and IQR methods.
      Parameters:
      df - The DataFrame to analyze
      columns - The column names to check for outliers
      Returns:
      DataFrame containing outlier analysis results for each column including outlier counts, bounds, and detailed outlier information
    • missingAnalysis

      public DataFrame missingAnalysis(DataFrame df)
      Analyzes missing values in all columns of the DataFrame.
      Parameters:
      df - The DataFrame to analyze
      Returns:
      DataFrame containing missing value analysis results including counts and percentages for each column
    • fullReport

      public Map<String,Object> fullReport(DataFrame df, String... columns)
      Generates a comprehensive statistical report for specified columns.
      Parameters:
      df - The DataFrame to analyze
      columns - The columns to include in the report (if empty, all columns are analyzed)
      Returns:
      Map containing detailed statistical analysis including basic statistics, missing values, outliers, correlations, and distribution analysis