initial commit for predictor module

2025-04-16 16:50:59 -04:00 · 2025-04-16 16:50:59 -04:00 · f598141500
commit f598141500
166 changed files with 6672 additions and 0 deletions
--- a/gru_sac_predictor/README.md
+++ b/gru_sac_predictor/README.md
@ -0,0 +1,141 @@
+# v7 - GRU + Simplified SAC Trading Agent (V6 GRU Adaptation)
+
+This project implements a cryptocurrency trading system using a GRU model for price prediction and a **Simplified SAC (Soft Actor-Critic)** agent for position sizing.
+
+The system predicts future *price* using a GRU model adapted from the V6 architecture. It calculates the *predicted percentage return* from this price prediction and estimates prediction *uncertainty* based on the standard deviation of Monte Carlo dropout predictions. These two values (`predicted_return`, `mc_unscaled_std_dev`) form the state input to the SAC reinforcement learning agent, which determines optimal position sizing (-1 to +1).
+
+The system incorporates efficiency improvements by pre-computing GRU predictions and uncertainties before generating SAC experiences or running the backtest. It includes detailed backtesting, performance reporting, and visualization capabilities.
+
+## System Design
+
+The system integrates a GRU predictor and a Simplified SAC agent within a backtesting framework.
+
+### 1. Data Flow & Processing
+
+1.  **Loading:** Raw 1-minute OHLCV data is loaded from the SQLite database directory specified in `main.py` (e.g., `downloaded_data/`) using `src.data_pipeline.load_data_from_db` which utilizes `src.crypto_db_fetcher.CryptoDBFetcher`.
+2.  **Splitting:** Data is chronologically split into training (60%), validation (20%), and test (20%) sets using `src.data_pipeline.create_data_pipeline`.
+3.  **GRU Training / Loading (on Train/Validation Sets):**
+    *   If `TRAIN_GRU_MODEL` is `True`:
+        *   *Preprocessing*: `TradingSystem._preprocess_data_for_gru_training` calculates V6 features plus basic return features (`calculate_v6_features`) on the raw train/val data. It determines the future *price* target (`prediction_horizon` steps ahead) and aligns features, targets (prices), and the *unscaled* starting close prices needed for return calculation.
+        *   *Scaling*: Within `TradingSystem.train_gru`, a `StandardScaler` is fitted *only* on the training features. A `MinMaxScaler` is fitted *only* on the training future *price* targets. Train and validation features/targets are scaled using these fitted scalers.
+        *   *Sequence Creation*: `src.data_pipeline.create_sequences_v2` creates input sequences `(batch, sequence_length, num_features)` and corresponding scaled target prices using the scaled features/targets and the unscaled start prices.
+        *   *Model Training*: `CryptoGRUModel.train` builds the V6-style GRU model (if not already built) and trains it using Mean Squared Error (MSE) loss on the scaled sequences. Callbacks monitor `val_rmse` for early stopping and model checkpointing. The best model (`best_model_reg.keras`) and the fitted scalers (`feature_scaler.joblib`, `y_scaler.joblib`) are saved.
+    *   If `LOAD_EXISTING_SYSTEM` is `True` and `TRAIN_GRU_MODEL` is `False`: Attempts to load a pre-trained GRU model and scalers. If `GRU_MODEL_LOAD_RUN_ID` is set in `main.py`, it loads from that specific run ID's directory (`v7/models/run_<run_id>`); otherwise, it attempts to load from the default `MODEL_SAVE_PATH` (expecting a `gru_model` subdirectory).
+4.  **SAC Training (on Validation Set):**
+    *   **Training Loop:** The training process runs for a fixed number of agent update steps (`TOTAL_TRAINING_STEPS`) instead of epochs.
+    *   **Experience Generation** (`TradingSystem.generate_trading_experiences`):
+        *   **Efficiency:** Pre-computes all required GRU outputs (predicted returns, uncertainties) for the entire validation set by calling `CryptoGRUModel.evaluate` *once*.
+        *   **Initial Fill:** Generates an initial set of experiences (`experience_config['initial_experiences']`). Uses the sampling strategy.
+        *   **Sampling (`_sample_experience_indices`):** When generating a specific number of experiences (initial fill or periodic updates), it applies **weighted sampling** (controlled by `recency_bias_strength`) and **stratified sampling** (ensuring minimum ratios `min_uncertainty_ratio`, `min_extreme_return_ratio` of high uncertainty/extreme return examples based on quantiles `high_uncertainty_quantile`, `extreme_return_quantile`) based on parameters in `experience_config`.
+        *   **Experience Format:** Iterates through the (potentially sampled) pre-computed results. Forms the state `s_t = [predicted_return_t, uncertainty_t]`. The SAC agent (`SimplifiedSACTradingAgent.get_action`) provides a *non-deterministic* action `a_t`. The next state `s_{t+1}` is retrieved. A reward `r_t = action * actual_return` is calculated (transaction costs are currently ignored in reward calculation during generation for simplicity). The transition `(s_t, a_t, r_t, s_{t+1}, done=False)` is created.
+        *   **Periodic Updates:** During the main training loop (controlled by `total_training_steps`), new batches of experiences (`experience_config['experiences_per_batch']`) are generated periodically (every `experience_config['batch_generation_interval']` loop steps) using the sampling strategy and added to the replay buffer.
+    *   **Agent Training** (`SimplifiedSACTradingAgent.train`): In each step of the main training loop, the agent performs `experience_config['training_iterations_per_step']` update(s). Batches are sampled from the replay buffer. Actor and Critic networks are updated using the SAC algorithm. The agent uses a standard FIFO circular buffer for experience storage.
+5.  **Backtesting (on Test Set):**
+    *   *Pre-computation* (`ExtendedBacktester.backtest`): Similar to SAC training, preprocesses the test data, scales it, creates sequences, and calls `CryptoGRUModel.evaluate` *once* to get all predicted returns and uncertainties for the test set.
+    *   *Iteration*: Steps chronologically through the pre-computed results.
+    *   *State Generation*: Retrieves `predicted_return` and `uncertainty_sigma` from the pre-computed arrays to form the state `s_t`.
+    *   *Action Selection*: The trained `SimplifiedSACTradingAgent` selects a *deterministic* action `a_t`.
+    *   *Portfolio Simulation*: Calculates PnL based on the previous position held (`current_position`), the actual return over the step, and subtracts transaction costs based on the change in position (`abs(action - current_position)`).
+    *   *Logging*: Records detailed metrics, trade history, and timestamps.
+6.  **Evaluation:**
+    *   *Performance Metrics*: `ExtendedBacktester._calculate_performance_metrics` computes overall portfolio metrics (Sharpe, Sortino, Drawdown, correlations, etc.) and Buy & Hold benchmark metrics.
+    *   *Visualization*: `ExtendedBacktester.plot_results` generates a 3-panel plot: GRU Predictions vs Actual Price (with uncertainty), SAC Actions (Position Size), and Portfolio Value vs Buy & Hold (with trade markers).
+    *   *Reporting*: `ExtendedBacktester.generate_performance_report` creates a detailed Markdown report.
+
+### 2. Core Components & Inputs/Outputs
+
+*   **`src.crypto_db_fetcher.CryptoDBFetcher`**: Loads and resamples data from SQLite DBs.
+*   **`src.data_pipeline`**: Functions for DB loading, data splitting, sequence creation.
+*   **`src.trading_system.calculate_v6_features`**: Calculates features (TA-Lib based V6 set + past returns).
+*   **`src.trading_system._preprocess_data_for_gru_training`**: Prepares features, future price targets, and start prices.
+*   **`src.gru_predictor.CryptoGRUModel`**: (V6 Adaptation)
+    *   `train()`: Trains the GRU price prediction model. Saves model (`.keras`) and scalers (`.joblib`).
+    *   `evaluate()`: Performs standard prediction and MC dropout inference. Returns dict including `pred_percent_change`, `mc_unscaled_std_dev`, `predicted_unscaled_prices`, `true_unscaled_prices`.
+*   **`src.sac_agent_simplified.SimplifiedSACTradingAgent`**: (V7 Simplified)
+    *   **Goal:** Learns a policy mapping state to optimal position size (-1.0 to +1.0). Optimized for faster training.
+    *   **State Input:** 2-element array `[predicted_return, mc_unscaled_std_dev]`.
+    *   **Action Output:** Float between -1.0 and +1.0.
+    *   `get_action()`: Selects action (stochastic or deterministic). Adds uncertainty-scaled noise during exploration.
+    *   `store_transition()`: Adds experience to internal NumPy buffer.
+    *   `train()`: Updates agent using buffer samples (internally handles batch size). Uses `@tf.function` for performance.
+    *   `save()` / `load()`: Handles Actor/Critic weights (`.weights.h5`), potentially `alpha.npy`.
+    *   **Note:** Models and optimizers are built explicitly during `__init__` to prevent TensorFlow graph mode issues.
+*   **`src.trading_system.TradingSystem`**: Integrates GRU and SAC. Manages training pipelines, experience generation (including advanced sampling).
+*   **`src.trading_system.ExtendedBacktester`**: Performs efficient backtesting using pre-computed GRU outputs, calculates metrics, plots results, generates reports.
+
+### 3. Model Architectures
+
+*   **GRU (`src.gru_predictor.CryptoGRUModel._build_model`)**: V6 Architecture.
+    *   Input -> GRU(100) -> Dropout(0.2) -> Dense(1, linear).
+    *   Compiled with Adam (LR=0.001), MSE loss.
+*   **Simplified SAC (`src.sac_agent_simplified.SimplifiedSACTradingAgent`)**:
+    *   **Actor Network**: MLP `(state_dim=2)` -> Dense(64, relu) -> [BN] -> Dense(64, relu) -> [BN] -> [Residual] -> Dense(1, tanh).
+    *   **Critic Network (x2)**: MLP `(state_dim=2 + action_dim=1)` -> Dense(64, relu) -> [BN] -> Dense(64, relu) -> [BN] -> [Residual] -> Dense(1, linear).
+    *   **Algorithm**: Implements SAC with Clipped Double-Q, fixed alpha (tunable via `SAC_ALPHA`), faster learning rates, smaller networks/buffer, optional Batch Normalization / Residual connections. Uses Huber loss for critics. No distributional critics. `@tf.function` used for update steps.
+
+### 4. Features
+
+The GRU model uses the V6 feature set plus basic past returns:
+*   **TA-Lib Indicators & Derived Indicators:** SMA, EMA, MACD, SAR, ADX, RSI, Stochastics, WILLR, ROC, CCI, BBands, ATR, OBV, CMF, etc. (Requires TA-Lib installation). Fallback calculations for SMA, EMA, RSI if TA-Lib is unavailable.
+*   **Custom Crypto Features:** Parkinson Volatility, Garman-Klass Volatility, VWAP ratios, Volume Intensity, Wick Ratios.
+*   **Past Returns:** `return_1m`, `return_5m`, `return_15m`, `return_60m` (percentage change).
+*   **Scaling:** Features scaled with `StandardScaler` (fitted on train). Target variable (future price) scaled with `MinMaxScaler` (fitted on train).
+
+### 5. Evaluation
+
+*   **GRU Model:** Evaluated using RMSE loss on validation set. Callbacks monitor `val_rmse`. Plots compare predicted vs actual price.
+*   **SAC Agent & Overall System:** Evaluated via the `ExtendedBacktester` metrics (Sharpe, Sortino, Max Drawdown, correlations, etc.), plots (Portfolio vs B&H, Actions), and a final Markdown report.
+
+## File Structure
+
+- `data/`: *Not used by default if loading from DB.*
+- `downloaded_data/`: **Place your V6 SQLite database files here.** (Or update `DB_DIR` in `main.py`).
+- `models/`: Trained models (GRU + scalers, SAC weights) saved here under `run_<run_id>/` directories by default.
+- `results/`: Backtest results (plots, reports, config) saved here under `<run_id>/` directories.
+- `logs/`: Log files saved here under `<run_id>/` directories.
+- `src/`: Core Python modules.
+  - `crypto_db_fetcher.py`: Class for fetching data from SQLite DBs.
+  - `data_pipeline.py`: DB loading function, data splitting, sequence creation.
+  - `gru_predictor.py`: V6-style GRU model for price regression and MC uncertainty.
+  - `sac_agent_simplified.py`: Simplified SAC agent implementation (V7.5+).
+  - `sac_agent.py`: Original SAC agent implementation (potentially outdated).
+  - `trading_system.py`: Integration class, feature calculation, scaling, experience generation, `ExtendedBacktester` class.
+- `main.py`: Main script using DB loading, orchestrates training and backtesting.
+- `requirements.txt`: Dependencies.
+- `v7_instructions.txt`: Design notes for Simplified SAC.
+- `experience_instructions.txt`: Design notes for experience generation.
+- `README.md`: This file.
+
+## Setup
+
+1.  **Data:** Place your V6 `downloaded_data` directory containing the SQLite files relative to the `v7` project root, or update the `DB_DIR` variable in `main.py` to point to the correct location.
+2.  **Dependencies:** Install required packages:
+    ```bash
+    pip install -r requirements.txt
+    ```
+    *Strongly Recommended:* Install TA-Lib for the full feature set. See TA-Lib installation guides for your OS.
+3.  **Configuration:** Review and adjust parameters in `main.py`. Key parameters include:
+    *   `DB_DIR`, `TICKER`, `EXCHANGE`, `START_DATE`, `END_DATE`, `INTERVAL`
+    *   Model hyperparameters (GRU and SAC sections)
+    *   Control Flags: `LOAD_EXISTING_SYSTEM`, `TRAIN_GRU_MODEL`, `TRAIN_SAC_AGENT`, `LOAD_SAC_AGENT`
+    *   Loading Specific Models: `GRU_MODEL_LOAD_RUN_ID` (set to a specific run ID string like `'YYYYMMDD_HHMMSS'` to load that GRU model from `v7/models/run_<run_id>/`). Note: This expects GRU and SAC files to be in the *same* directory if loading this way.
+    *   SAC Training: `TOTAL_TRAINING_STEPS` defines the length of SAC training (number of agent `train()` calls).
+    *   Experience Generation: `experience_config` dictionary controls initial fill, periodic updates, and sampling strategies (recency bias, stratification for uncertainty/extreme returns).
+    *   Backtesting: `INITIAL_CAPITAL`, `TRANSACTION_COST`.
+4.  **Run:** Execute from the project root directory (containing the `v7` folder):
+    ```bash
+    python -m v7.main
+    ```
+    Output files (logs, models, plots, report) will be generated in `v7/logs/`, `v7/models/`, and `v7/results/` within run-specific subdirectories.
+
+## Reporting
+
+The report generated by the `ExtendedBacktester` includes performance metrics, correlation analyses, and configuration details. Key metrics include:
+
+*   Total/Annualized Return
+*   Sharpe & Sortino Ratios
+*   Volatility & Max Drawdown
+*   Buy & Hold Comparison
+*   Position/Prediction Accuracy
+*   Prediction/Position/Uncertainty Correlations
+*   Total Trades 
--- a/gru_sac_predictor/pycache/main.cpython-312.pyc
+++ b/gru_sac_predictor/pycache/main.cpython-312.pyc
--- a/gru_sac_predictor/logs/20250416_142744/main_v7_20250416_142744.log
+++ b/gru_sac_predictor/logs/20250416_142744/main_v7_20250416_142744.log
--- a/gru_sac_predictor/logs/20250416_144232/main_v7_20250416_144232.log
+++ b/gru_sac_predictor/logs/20250416_144232/main_v7_20250416_144232.log
--- a/gru_sac_predictor/logs/20250416_144418/main_v7_20250416_144418.log
+++ b/gru_sac_predictor/logs/20250416_144418/main_v7_20250416_144418.log
--- a/gru_sac_predictor/logs/20250416_144645/main_v7_20250416_144645.log
+++ b/gru_sac_predictor/logs/20250416_144645/main_v7_20250416_144645.log
--- a/gru_sac_predictor/logs/20250416_144757/main_v7_20250416_144757.log
+++ b/gru_sac_predictor/logs/20250416_144757/main_v7_20250416_144757.log
--- a/gru_sac_predictor/logs/20250416_144847/main_v7_20250416_144847.log
+++ b/gru_sac_predictor/logs/20250416_144847/main_v7_20250416_144847.log
--- a/gru_sac_predictor/logs/20250416_145035/main_v7_20250416_145035.log
+++ b/gru_sac_predictor/logs/20250416_145035/main_v7_20250416_145035.log
--- a/gru_sac_predictor/logs/20250416_145128/main_v7_20250416_145128.log
+++ b/gru_sac_predictor/logs/20250416_145128/main_v7_20250416_145128.log
--- a/gru_sac_predictor/logs/20250416_150616/main_v7_20250416_150616.log
+++ b/gru_sac_predictor/logs/20250416_150616/main_v7_20250416_150616.log
--- a/gru_sac_predictor/logs/20250416_150829/main_v7_20250416_150829.log
+++ b/gru_sac_predictor/logs/20250416_150829/main_v7_20250416_150829.log
--- a/gru_sac_predictor/logs/20250416_150924/main_v7_20250416_150924.log
+++ b/gru_sac_predictor/logs/20250416_150924/main_v7_20250416_150924.log
--- a/gru_sac_predictor/logs/20250416_151322/main_v7_20250416_151322.log
+++ b/gru_sac_predictor/logs/20250416_151322/main_v7_20250416_151322.log
--- a/gru_sac_predictor/logs/20250416_151849/main_v7_20250416_151849.log
+++ b/gru_sac_predictor/logs/20250416_151849/main_v7_20250416_151849.log
--- a/gru_sac_predictor/logs/20250416_152415/main_v7_20250416_152415.log
+++ b/gru_sac_predictor/logs/20250416_152415/main_v7_20250416_152415.log
--- a/gru_sac_predictor/logs/20250416_153132/main_v7_20250416_153132.log
+++ b/gru_sac_predictor/logs/20250416_153132/main_v7_20250416_153132.log
--- a/gru_sac_predictor/logs/20250416_153846/main_v7_20250416_153846.log
+++ b/gru_sac_predictor/logs/20250416_153846/main_v7_20250416_153846.log
--- a/gru_sac_predictor/logs/20250416_154636/main_v7_20250416_154636.log
+++ b/gru_sac_predictor/logs/20250416_154636/main_v7_20250416_154636.log
--- a/gru_sac_predictor/logs/20250416_162528/main_v7_20250416_162528.log
+++ b/gru_sac_predictor/logs/20250416_162528/main_v7_20250416_162528.log
--- a/gru_sac_predictor/logs/20250416_162624/main_v7_20250416_162624.log
+++ b/gru_sac_predictor/logs/20250416_162624/main_v7_20250416_162624.log
--- a/gru_sac_predictor/logs/20250416_162718/main_v7_20250416_162718.log
+++ b/gru_sac_predictor/logs/20250416_162718/main_v7_20250416_162718.log
--- a/gru_sac_predictor/logs/20250416_162921/main_v7_20250416_162921.log
+++ b/gru_sac_predictor/logs/20250416_162921/main_v7_20250416_162921.log
--- a/gru_sac_predictor/logs/20250416_163030/main_v7_20250416_163030.log
+++ b/gru_sac_predictor/logs/20250416_163030/main_v7_20250416_163030.log
--- a/gru_sac_predictor/logs/20250416_163440/main_v7_20250416_163440.log
+++ b/gru_sac_predictor/logs/20250416_163440/main_v7_20250416_163440.log
--- a/gru_sac_predictor/logs/20250416_164324/main_20250416_164324.log
+++ b/gru_sac_predictor/logs/20250416_164324/main_20250416_164324.log
--- a/gru_sac_predictor/logs/20250416_164410/main_20250416_164410.log
+++ b/gru_sac_predictor/logs/20250416_164410/main_20250416_164410.log
--- a/gru_sac_predictor/logs/20250416_164547/main_20250416_164547.log
+++ b/gru_sac_predictor/logs/20250416_164547/main_20250416_164547.log
--- a/gru_sac_predictor/logs/20250416_164726/main_20250416_164726.log
+++ b/gru_sac_predictor/logs/20250416_164726/main_20250416_164726.log
--- a/gru_sac_predictor/logs/main_v7.log
+++ b/gru_sac_predictor/logs/main_v7.log
--- a/gru_sac_predictor/main.py
+++ b/gru_sac_predictor/main.py
@ -0,0 +1,465 @@
+import pandas as pd
+import numpy as np
+import matplotlib.pyplot as plt
+import os
+from datetime import datetime
+import warnings
+import logging
+import sys
+import json
+
+# --- Generate Run ID ---
+run_id = datetime.now().strftime("%Y%m%d_%H%M%S")
+
+# Import components
+# V7 Update: Import load_data_from_db
+from .src.data_pipeline import create_data_pipeline, load_data_from_db
+from .src.trading_system import TradingSystem, ExtendedBacktester, plot_sac_training_history
+# V7.3 Fix: Add missing imports
+# V7-V6 Final Update: Import CryptoGRUModel
+from .src.gru_predictor import CryptoGRUModel 
+# V7.5 Import the simplified agent
+from .src.sac_agent_simplified import SimplifiedSACTradingAgent 
+# GRU and SAC classes are implicitly imported via TradingSystem
+
+# --- Base Output Directories ---
+BASE_RESULTS_DIR = "gru_sac_predictor/results"
+BASE_LOGS_DIR = "gru_sac_predictor/logs"
+BASE_MODELS_DIR = "gru_sac_predictor/models"
+
+# --- Run Specific Directories ---
+RUN_RESULTS_DIR = os.path.join(BASE_RESULTS_DIR, run_id)
+RUN_LOGS_DIR = os.path.join(BASE_LOGS_DIR, run_id)
+RUN_MODELS_DIR = os.path.join(BASE_MODELS_DIR, f"run_{run_id}")
+
+# --- Logging Setup ---
+log_format = '%(asctime)s - %(name)s - %(levelname)s - %(message)s'
+# Ensure logs directory exists
+os.makedirs(RUN_LOGS_DIR, exist_ok=True)
+log_file_path = os.path.join(RUN_LOGS_DIR, f"main_{run_id}.log") # Removed _v7
+logging.basicConfig(
+    level=logging.INFO,
+    format=log_format,
+    handlers=[
+        logging.FileHandler(log_file_path, mode='a'), # Use path variable
+        logging.StreamHandler(sys.stdout)
+    ]
+)
+logger = logging.getLogger(__name__)
+
+# --- Configuration ---
+# V7 Update: Add DB parameters
+DB_DIR = '../downloaded_data' # V7 Fix: Point to correct relative path for V6 data
+TICKER = 'BTC-USD' # Example ticker
+EXCHANGE = 'COINBASE' # Example exchange
+START_DATE = '2025-03-01' # Example start date - NOTE: VERY SHORT!
+END_DATE = '2025-03-10' # Example end date - NOTE: VERY SHORT!
+INTERVAL = '1min' # Data interval to fetch and use
+
+MODEL_SAVE_PATH = RUN_MODELS_DIR # Use run-specific directory
+# Updated paths to use RUN_RESULTS_DIR and include run_id
+RESULTS_PLOT_PATH = os.path.join(RUN_RESULTS_DIR, f'backtest_results_{run_id}.png') # Removed _v7
+REPORT_SAVE_PATH = os.path.join(RUN_RESULTS_DIR, f'backtest_performance_report_{run_id}.md') # Removed _v7
+# GRU_PLOT_PATH = 'gru_performance_v7.png' # Not used directly in main
+
+# V7.6 Add specific run ID for loading GRU model
+GRU_MODEL_LOAD_RUN_ID = '20250416_142744' # Set this to a specific 'YYYYMMDD_HHMMSS' string to load that GRU model
+
+# Data split ratios
+TRAIN_RATIO = 0.6
+VALIDATION_RATIO = 0.2
+
+# Model/Training Parameters (V7.3)
+GRU_LOOKBACK = 60
+GRU_PREDICTION_HORIZON = 1
+GRU_EPOCHS = 20
+GRU_BATCH_SIZE = 32 # Updated default
+GRU_PATIENCE = 10 # Updated default
+GRU_LR_PATIENCE = 10 # Updated default
+GRU_LR_FACTOR = 0.5 # Updated default
+GRU_RETURN_SCALE = 0.03 # Updated default
+
+# SAC Parameters (V7.5 - Simplified Agent)
+SAC_STATE_DIM = 5 # [pred_return, uncertainty, z, momentum_5, volatility_20] - Updated from 2
+SAC_HIDDEN_SIZE = 64
+SAC_GAMMA = 0.97
+SAC_TAU = 0.02
+# SAC_ALPHA = 0.1 # Removed - Will use automatic tuning
+SAC_ACTOR_LR = 3e-4 # Lowered from 5e-4
+SAC_CRITIC_LR = 5e-4 # Lowered from 8e-4
+SAC_BATCH_SIZE = 64
+SAC_BUFFER_MAX_SIZE = 20000
+SAC_MIN_BUFFER_SIZE = 1000
+SAC_UPDATE_INTERVAL = 1
+SAC_TARGET_UPDATE_INTERVAL = 2
+SAC_GRADIENT_CLIP = 1.0
+SAC_REWARD_SCALE = 2.0 # Decreased from 10.0
+SAC_USE_BATCH_NORM = True
+SAC_USE_RESIDUAL = True
+SAC_MODEL_DIR = 'models/simplified_sac' # Default dir within the agent class
+SAC_EPOCHS = 50 # Keep this from previous config for training loop control
+
+# V7.9 Experience Generation Config (Based on instructions.txt)
+# TOTAL_TRAINING_STEPS = 1000 # Removed - Not used in current training loop
+experience_config = {
+    # Basic setup
+    'initial_experiences': 3000,      # Start with this many experiences
+    'experiences_per_batch': 64,      # Generate this many in each new batch
+    'batch_generation_interval': 500, # Generate a new batch every N training steps
+
+    # Distribution control (Flags for future implementation in generate_trading_experiences)
+    'balance_market_regimes': False,    # Not implemented
+    'recency_bias_strength': 0.5,        # 0 = uniform, >0 weights recent data more
+    'high_uncertainty_quantile': 0.75,   # Threshold for high uncertainty
+    'extreme_return_quantile': 0.1,      # Threshold for extreme returns (upper/lower)
+    'min_uncertainty_ratio': 0.2,        # Min % of samples with high uncertainty
+    'min_extreme_return_ratio': 0.1,     # Min % of samples with extreme returns
+
+    # Efficient processing
+    'use_parallel_generation': False, # Not implemented
+    'precompute_all_gru_outputs': True, # Already implemented
+    'buffer_update_strategy': 'fifo', # Agent currently uses FIFO
+
+    # Training optimization
+    'training_iterations_per_step': 1, # Number of agent.train calls per main loop step
+    # Max/Min buffer size are defined by the agent itself now
+}
+
+# Backtesting Parameters
+INITIAL_CAPITAL = 10000.0
+TRANSACTION_COST = 0.0005
+# V7.12 Add Opportunity Cost Penalty Parameters
+OPPORTUNITY_COST_PENALTY_FACTOR = 0.0 # How much to penalize missed high returns - Disabled (was 1.0)
+HIGH_RETURN_THRESHOLD = 0.002       # Actual return magnitude threshold to trigger penalty check
+ACTION_TOLERANCE = 0.3              # Action magnitude below which penalty applies if return threshold met - Lowered from 0.5
+# RISK_PENALTY_FACTOR = 0.0 # Removed as state reverted
+
+# Control Flags
+LOAD_EXISTING_SYSTEM = True
+TRAIN_GRU_MODEL = False
+TRAIN_SAC_AGENT = True # V7.8 Set to True to train SAC
+LOAD_SAC_AGENT = False # V7.8 Set to False to avoid loading SAC
+RUN_BACKTEST = True
+GENERATE_PLOTS = True
+GENERATE_REPORT = True
+# --- End Configuration ---
+
+def main():
+    # Access config variables defined at module level
+    global LOAD_EXISTING_SYSTEM, TRAIN_GRU_MODEL, TRAIN_SAC_AGENT, LOAD_SAC_AGENT
+
+    logger.info(f"--- Starting GRU+SAC Trading System Pipeline (Run ID: {run_id}) ---") # Removed V7
+    
+    # Ensure results directory exists
+    os.makedirs(RUN_RESULTS_DIR, exist_ok=True)
+    # Ensure base models directory exists (RUN_MODELS_DIR created later if training)
+    os.makedirs(BASE_MODELS_DIR, exist_ok=True)
+
+    # LOAD_EXISTING_SYSTEM is now declared global before use here
+    # --- Save Configuration --- 
+    config_to_save = {
+        "run_id": run_id,
+        "db_dir": DB_DIR,
+        "ticker": TICKER,
+        "exchange": EXCHANGE,
+        "start_date": START_DATE,
+        "end_date": END_DATE,
+        "interval": INTERVAL,
+        "model_save_path": MODEL_SAVE_PATH,
+        "results_plot_path": RESULTS_PLOT_PATH,
+        "report_save_path": REPORT_SAVE_PATH,
+        "train_ratio": TRAIN_RATIO,
+        "validation_ratio": VALIDATION_RATIO,
+        "gru_lookback": GRU_LOOKBACK,
+        "gru_prediction_horizon": GRU_PREDICTION_HORIZON,
+        "gru_epochs": GRU_EPOCHS,
+        "gru_batch_size": GRU_BATCH_SIZE,
+        "gru_patience": GRU_PATIENCE,
+        "gru_lr_factor": GRU_LR_FACTOR,
+        "gru_return_scale": GRU_RETURN_SCALE,
+        "gru_model_load_run_id": GRU_MODEL_LOAD_RUN_ID,
+        "sac_state_dim": SAC_STATE_DIM,
+        "sac_hidden_size": SAC_HIDDEN_SIZE,
+        "sac_gamma": SAC_GAMMA,
+        "sac_tau": SAC_TAU,
+        "sac_actor_lr": SAC_ACTOR_LR,
+        "sac_critic_lr": SAC_CRITIC_LR,
+        "sac_batch_size": SAC_BATCH_SIZE,
+        "sac_buffer_max_size": SAC_BUFFER_MAX_SIZE,
+        "sac_min_buffer_size": SAC_MIN_BUFFER_SIZE,
+        "sac_update_interval": SAC_UPDATE_INTERVAL,
+        "sac_target_update_interval": SAC_TARGET_UPDATE_INTERVAL,
+        "sac_gradient_clip": SAC_GRADIENT_CLIP,
+        "sac_reward_scale": SAC_REWARD_SCALE,
+        "sac_use_batch_norm": SAC_USE_BATCH_NORM,
+        "sac_use_residual": SAC_USE_RESIDUAL,
+        "sac_model_dir": SAC_MODEL_DIR,
+        "sac_epochs": SAC_EPOCHS,
+        "experience_config": experience_config,
+        "initial_capital": INITIAL_CAPITAL,
+        "transaction_cost": TRANSACTION_COST,
+        # V7.12 Add new params to saved config
+        "opportunity_cost_penalty_factor": OPPORTUNITY_COST_PENALTY_FACTOR,
+        "high_return_threshold": HIGH_RETURN_THRESHOLD,
+        "action_tolerance": ACTION_TOLERANCE,
+        "load_existing_system": LOAD_EXISTING_SYSTEM,
+        "train_gru_model": TRAIN_GRU_MODEL,
+        "train_sac_agent": TRAIN_SAC_AGENT,
+        "load_sac_agent": LOAD_SAC_AGENT,
+        "run_backtest": RUN_BACKTEST,
+        "generate_plots": GENERATE_PLOTS,
+        "generate_report": GENERATE_REPORT
+    }
+    config_save_path = os.path.join(RUN_RESULTS_DIR, f'config_{run_id}.json')
+    try:
+        with open(config_save_path, 'w') as f:
+            json.dump(config_to_save, f, indent=4)
+        logger.info(f"Run configuration saved to {config_save_path}")
+    except Exception as e:
+        logger.error(f"Failed to save run configuration: {e}")
+    # --- End Save Configuration ---
+
+    # 1. Load Data from Database
+    logger.info(f"Loading data from DB: {TICKER}/{EXCHANGE} ({START_DATE}-{END_DATE}) @ {INTERVAL}")
+    data = load_data_from_db(
+        db_dir=DB_DIR,
+        ticker=TICKER,
+        exchange=EXCHANGE,
+        start_date=START_DATE,
+        end_date=END_DATE,
+        interval=INTERVAL
+    )
+
+    if data.empty:
+        logger.error("Failed to load data from database. Please check DB_DIR and parameters. Aborting.")
+        return
+
+    # --- Re-inserted Steps Start ---
+    # Basic Data Validation (Timestamp index assumed from load_data_from_db)
+    if 'close' not in data.columns: # Check essential columns
+        raise ValueError("Loaded data must contain 'close' column.")
+    logger.info(f"Data loaded: {len(data)} rows, from {data.index.min()} to {data.index.max()}")
+    initial_len = len(data); data.dropna(subset=['open', 'high', 'low', 'close', 'volume'], inplace=True)
+    if len(data) < initial_len: logger.info(f"Dropped {initial_len - len(data)} NaN rows.")
+    if len(data) < GRU_LOOKBACK * 3: raise ValueError(f"Insufficient data ({len(data)} rows) for lookback/splits.")
+
+    # Add cyclical features immediately
+    logger.info("Calculating cyclical time features (hour_sin, hour_cos)...")
+    timestamp_source = None
+    if isinstance(data.index, pd.DatetimeIndex):
+        timestamp_source = data.index
+        logger.debug("Using index for hour features.")
+    elif 'timestamp' in data.columns and pd.api.types.is_datetime64_any_dtype(data['timestamp']):
+        timestamp_source = pd.to_datetime(data['timestamp']) 
+        logger.debug("Using 'timestamp' column for hour features.")
+    elif 'date' in data.columns and pd.api.types.is_datetime64_any_dtype(data['date']):
+         timestamp_source = pd.to_datetime(data['date']) 
+         logger.debug("Using 'date' column for hour features.")
+    
+    if timestamp_source is not None:
+        data['hour_sin'] = np.sin(2 * np.pi * timestamp_source.hour / 24)
+        data['hour_cos'] = np.cos(2 * np.pi * timestamp_source.hour / 24)
+        logger.info("Added hour_sin/hour_cos to main dataframe.")
+    else:
+         logger.warning("Could not find suitable timestamp source. Setting hour_sin/cos defaults (0.0, 1.0).")
+         data['hour_sin'] = 0.0
+         data['hour_cos'] = 1.0 # Default to cos(0) = 1
+
+    # 2. Split Data Chronologically
+    logger.info("Splitting data...")
+    test_ratio = round(1.0 - TRAIN_RATIO - VALIDATION_RATIO, 2)
+    if test_ratio <= 0: raise ValueError("Train+Validation ratios must sum to < 1.")
+    train_data, val_data, test_data = create_data_pipeline(data, [TRAIN_RATIO, VALIDATION_RATIO, test_ratio])
+    if len(train_data) < GRU_LOOKBACK or len(val_data) < GRU_LOOKBACK or len(test_data) < GRU_LOOKBACK:
+         warnings.warn(f"Splits smaller than GRU lookback ({GRU_LOOKBACK}). Backtesting might fail.")
+
+    # 3. Initialize Trading System
+    logger.info("Initializing Trading System...")
+    trading_system = TradingSystem(
+        gru_model=CryptoGRUModel(), # Instantiate the correct model
+        sac_agent=SimplifiedSACTradingAgent(
+            state_dim=SAC_STATE_DIM,
+            hidden_size=SAC_HIDDEN_SIZE,
+            gamma=SAC_GAMMA,
+            tau=SAC_TAU,
+            actor_lr=SAC_ACTOR_LR,
+            critic_lr=SAC_CRITIC_LR,
+            batch_size=SAC_BATCH_SIZE,
+            buffer_max_size=SAC_BUFFER_MAX_SIZE,
+            min_buffer_size=SAC_MIN_BUFFER_SIZE,
+            update_interval=SAC_UPDATE_INTERVAL,
+            target_update_interval=SAC_TARGET_UPDATE_INTERVAL,
+            gradient_clip=SAC_GRADIENT_CLIP,
+            reward_scale=SAC_REWARD_SCALE,
+            use_batch_norm=SAC_USE_BATCH_NORM,
+            use_residual=SAC_USE_RESIDUAL,
+            model_dir=os.path.join(MODEL_SAVE_PATH, 'sac_agent') # Point to subfolder within run
+        ), # Pass the configured agent
+        gru_lookback=GRU_LOOKBACK
+    )
+
+    # --- Model Loading/Training --- 
+    gru_loaded = False; sac_loaded = False
+    if LOAD_EXISTING_SYSTEM:
+        load_base_path = MODEL_SAVE_PATH 
+        logger.info(f"Attempting to load existing system components...")
+        logger.info(f"Base path for loading: {load_base_path}")
+
+        gru_model_load_dir = None
+        sac_model_load_dir = None
+        if GRU_MODEL_LOAD_RUN_ID:
+            gru_model_load_dir = os.path.join(BASE_MODELS_DIR, f'run_{GRU_MODEL_LOAD_RUN_ID}') 
+            logger.info(f"Using specific GRU load path based on run ID: {gru_model_load_dir}")
+            if LOAD_SAC_AGENT:
+                 sac_model_load_dir = os.path.join(BASE_MODELS_DIR, f'run_{GRU_MODEL_LOAD_RUN_ID}')
+                 logger.info(f"Using specific SAC load path based on GRU run ID (LOAD_SAC_AGENT=True): {sac_model_load_dir}")
+            else:
+                 sac_model_load_dir = os.path.join(MODEL_SAVE_PATH, 'sac_agent')
+                 logger.info(f"Defaulting SAC path to current run (LOAD_SAC_AGENT=False): {sac_model_load_dir}")
+        elif os.path.exists(load_base_path):
+            gru_model_load_dir = os.path.join(load_base_path, 'gru_model')
+            sac_model_load_dir = os.path.join(load_base_path, 'sac_agent')
+            logger.info(f"Using GRU load path based on MODEL_SAVE_PATH: {gru_model_load_dir}")
+            logger.info(f"Using SAC load path based on MODEL_SAVE_PATH: {sac_model_load_dir}")
+        else:
+            logger.warning(f"LOAD_EXISTING_SYSTEM is True, but MODEL_SAVE_PATH does not exist: {load_base_path}. Cannot determine model paths.")
+            LOAD_EXISTING_SYSTEM = False
+
+        if LOAD_EXISTING_SYSTEM:
+            try:
+                if gru_model_load_dir and os.path.isdir(gru_model_load_dir):
+                    logger.info(f"Found GRU model directory: {gru_model_load_dir}. Loading...")
+                    if trading_system.gru_model is None: trading_system.gru_model = CryptoGRUModel()
+                    if trading_system.gru_model.load(gru_model_load_dir):
+                        logger.info("GRU model loaded successfully.")
+                        gru_loaded = True
+                        trading_system.feature_scaler = trading_system.gru_model.feature_scaler
+                        trading_system.y_scaler = trading_system.gru_model.y_scaler
+                        logger.info("Scalers propagated from loaded GRU model.")
+                    else: logger.warning(f"GRU model directory found, but loading failed.")
+                elif gru_model_load_dir: logger.warning(f"GRU model directory specified or derived, but not found at {gru_model_load_dir}. GRU model cannot be loaded.")
+                else: logger.warning("GRU model path could not be determined. GRU model cannot be loaded.")
+
+                if LOAD_SAC_AGENT:
+                    if sac_model_load_dir and os.path.isdir(sac_model_load_dir):
+                        logger.info(f"Found SAC model directory: {sac_model_load_dir}. Loading (LOAD_SAC_AGENT=True)...")
+                        if trading_system.sac_agent is None:
+                             trading_system.sac_agent = SimplifiedSACTradingAgent(state_dim=SAC_STATE_DIM, model_dir=sac_model_load_dir)
+                        if trading_system.sac_agent.load(sac_model_load_dir): 
+                            logger.info("SAC agent loaded successfully.")
+                            sac_loaded = True
+                        else: logger.warning(f"SAC model directory found, but loading failed.")
+                    elif sac_model_load_dir: logger.warning(f"SAC agent model directory derived, but not found at {sac_model_load_dir}. SAC agent cannot be loaded (LOAD_SAC_AGENT=True).")
+                else: logger.info("Skipping SAC agent loading (LOAD_SAC_AGENT=False).")
+
+                if gru_loaded: TRAIN_GRU_MODEL = False
+                if sac_loaded: TRAIN_SAC_AGENT = False; LOAD_SAC_AGENT = True 
+
+            except Exception as e:
+                logger.warning(f"Could not load existing system components: {e}. Proceeding based on training flags.")
+                gru_loaded = False; sac_loaded = False
+                TRAIN_GRU_MODEL = True; TRAIN_SAC_AGENT = True; LOAD_SAC_AGENT = False
+
+    elif LOAD_EXISTING_SYSTEM: pass 
+    else: logger.info("LOAD_EXISTING_SYSTEM=False. Proceeding with training flags.")
+
+    # --- Sanity Check After Loading --- 
+    if not gru_loaded and not TRAIN_GRU_MODEL:
+        logger.error("Critical Error: GRU model was not loaded and TRAIN_GRU_MODEL is False. Cannot proceed.")
+        return 
+    if not sac_loaded and not TRAIN_SAC_AGENT:
+        if RUN_BACKTEST:
+             logger.error("Critical Error: SAC agent was not loaded and TRAIN_SAC_AGENT is False. Aborting because RUN_BACKTEST is True.")
+             return
+        else: logger.warning("Proceeding without a functional SAC agent as RUN_BACKTEST is False.")
+
+    # Train GRU Model (if flag is set and not loaded)
+    if TRAIN_GRU_MODEL:
+        logger.info("--- Training GRU Model --- ")
+        gru_save_dir = MODEL_SAVE_PATH
+        history = trading_system.train_gru(
+            train_data=train_data, val_data=val_data,
+            prediction_horizon=GRU_PREDICTION_HORIZON,
+            epochs=GRU_EPOCHS, batch_size=GRU_BATCH_SIZE,
+            patience=GRU_PATIENCE,
+            model_save_dir=gru_save_dir
+        )
+        if history is None: logger.error("GRU Training failed. Aborting."); return
+        logger.info("--- GRU Model Training Finished --- ")
+    elif not gru_loaded: logger.error("GRU Model must be trained or loaded."); return
+    else: logger.info("Skipping GRU training (already loaded).")
+
+    # Train SAC Agent (if flag is set and not loaded)
+    if TRAIN_SAC_AGENT:
+        logger.info("--- Training SAC Agent --- ")
+        if not trading_system.gru_model or not (trading_system.gru_model.is_trained or trading_system.gru_model.is_loaded):
+             logger.error("Cannot train SAC: GRU model not ready."); return
+        
+        if trading_system.sac_agent is None: logger.error("SAC Agent instance is missing in the trading system before training."); return
+        trading_system.sac_agent.model_dir = os.path.join(MODEL_SAVE_PATH, 'sac_agent')
+        logger.info(f"Ensured SAC agent model save dir is set to: {trading_system.sac_agent.model_dir}")
+
+        sac_history = trading_system.train_sac(
+            val_data=val_data,
+            epochs=SAC_EPOCHS,
+            batch_size=SAC_BATCH_SIZE,
+            transaction_cost=TRANSACTION_COST,
+            prediction_horizon=GRU_PREDICTION_HORIZON
+        )
+        logger.info("Finished training SAC agent.")
+
+        if sac_history is not None:
+            sac_save_dir = os.path.join(MODEL_SAVE_PATH, 'sac_agent')
+            logger.info(f"Saving Simplified SAC agent to {sac_save_dir}")
+            trading_system.sac_agent.save(sac_save_dir)
+            
+            if sac_history: 
+                 sac_plot_save_path = os.path.join(RUN_RESULTS_DIR, f'sac_training_history_{run_id}.png')
+                 logger.info(f"Plotting SAC training history to {sac_plot_save_path}...")
+                 try: plot_sac_training_history(sac_history, save_path=sac_plot_save_path)
+                 except Exception as plot_e: logger.error(f"Failed to plot SAC training history: {plot_e}", exc_info=True)
+            else: logger.warning("SAC training finished, but no history data returned for plotting.")
+                      
+    elif not sac_loaded and LOAD_SAC_AGENT: 
+        # This block handles loading SAC if LOAD_EXISTING_SYSTEM was False but LOAD_SAC_AGENT was True (unlikely case)
+         if trading_system.sac_agent is None: trading_system.sac_agent = SimplifiedSACTradingAgent(state_dim=SAC_STATE_DIM) 
+         sac_load_path = os.path.join(MODEL_SAVE_PATH, 'sac_agent') # Load from current run models
+         if os.path.isdir(sac_load_path):
+             logger.info(f"Attempting to load SAC weights from {sac_load_path} (LOAD_SAC_AGENT=True)...")
+             try: trading_system.sac_agent.load(sac_load_path); logger.info("SAC weights loaded."); sac_loaded = True
+             except Exception as e: logger.warning(f"Could not load SAC weights: {e}")
+         else: logger.warning(f"LOAD_SAC_AGENT=True but no weights found at {sac_load_path}.")
+    elif not sac_loaded: logger.warning("SAC Agent not trained or loaded.")
+    else: logger.info("Skipping SAC training (already loaded).")
+
+    # 5. Backtest on Test Data
+    if RUN_BACKTEST:
+        logger.info("--- Running Extended Backtest --- ")
+        if not trading_system.gru_model or not (trading_system.gru_model.is_trained or trading_system.gru_model.is_loaded):
+             logger.error("Cannot backtest: GRU model not ready."); return
+        if not trading_system.sac_agent: logger.error("Cannot backtest: SAC Agent not initialized."); return
+
+        instrument_label = f"{TICKER}/{EXCHANGE}"
+        backtester = ExtendedBacktester(
+            trading_system, 
+            initial_capital=INITIAL_CAPITAL, 
+            transaction_cost=TRANSACTION_COST,
+            instrument_label=instrument_label
+        )
+        backtest_results = backtester.backtest(test_data, verbose=True)
+
+        # 6. Generate Plots and Report
+        if GENERATE_PLOTS:
+            logger.info(f"Generating overall performance plot: {RESULTS_PLOT_PATH}...")
+            backtester.plot_results(save_path=RESULTS_PLOT_PATH)
+        if GENERATE_REPORT:
+            logger.info(f"Generating performance report: {REPORT_SAVE_PATH}...")
+            backtester.generate_performance_report(report_path=REPORT_SAVE_PATH)
+    else:
+        logger.info("Skipping backtesting.")
+    # --- Re-inserted Steps End ---
+
+    logger.info("--- GRU+SAC Pipeline Finished --- ")
+
+if __name__ == "__main__":
+    main() 
--- a/gru_sac_predictor/main_v7.log
+++ b/gru_sac_predictor/main_v7.log
--- a/gru_sac_predictor/models/run_20250416_142744/actor.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_142744/actor.weights.h5
--- a/gru_sac_predictor/models/run_20250416_142744/best_model_reg.keras
+++ b/gru_sac_predictor/models/run_20250416_142744/best_model_reg.keras
--- a/gru_sac_predictor/models/run_20250416_142744/critic1.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_142744/critic1.weights.h5
--- a/gru_sac_predictor/models/run_20250416_142744/critic2.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_142744/critic2.weights.h5
--- a/gru_sac_predictor/models/run_20250416_142744/feature_scaler.joblib
+++ b/gru_sac_predictor/models/run_20250416_142744/feature_scaler.joblib
--- a/gru_sac_predictor/models/run_20250416_142744/gru_training_history.png
+++ b/gru_sac_predictor/models/run_20250416_142744/gru_training_history.png
--- a/gru_sac_predictor/models/run_20250416_142744/log_alpha.npy
+++ b/gru_sac_predictor/models/run_20250416_142744/log_alpha.npy
--- a/gru_sac_predictor/models/run_20250416_142744/y_scaler.joblib
+++ b/gru_sac_predictor/models/run_20250416_142744/y_scaler.joblib
--- a/gru_sac_predictor/models/run_20250416_144757/sac_agent/actor.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_144757/sac_agent/actor.weights.h5
--- a/gru_sac_predictor/models/run_20250416_144757/sac_agent/alpha.npy
+++ b/gru_sac_predictor/models/run_20250416_144757/sac_agent/alpha.npy
--- a/gru_sac_predictor/models/run_20250416_144757/sac_agent/critic_1.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_144757/sac_agent/critic_1.weights.h5
--- a/gru_sac_predictor/models/run_20250416_144757/sac_agent/critic_2.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_144757/sac_agent/critic_2.weights.h5
--- a/gru_sac_predictor/models/run_20250416_144757/sac_agent/target_critic_1.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_144757/sac_agent/target_critic_1.weights.h5
--- a/gru_sac_predictor/models/run_20250416_144757/sac_agent/target_critic_2.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_144757/sac_agent/target_critic_2.weights.h5
--- a/gru_sac_predictor/models/run_20250416_145128/sac_agent/actor.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_145128/sac_agent/actor.weights.h5
--- a/gru_sac_predictor/models/run_20250416_145128/sac_agent/alpha.npy
+++ b/gru_sac_predictor/models/run_20250416_145128/sac_agent/alpha.npy
--- a/gru_sac_predictor/models/run_20250416_145128/sac_agent/critic_1.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_145128/sac_agent/critic_1.weights.h5
--- a/gru_sac_predictor/models/run_20250416_145128/sac_agent/critic_2.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_145128/sac_agent/critic_2.weights.h5
--- a/gru_sac_predictor/models/run_20250416_145128/sac_agent/target_critic_1.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_145128/sac_agent/target_critic_1.weights.h5
--- a/gru_sac_predictor/models/run_20250416_145128/sac_agent/target_critic_2.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_145128/sac_agent/target_critic_2.weights.h5
--- a/gru_sac_predictor/models/run_20250416_150829/sac_agent/actor.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_150829/sac_agent/actor.weights.h5
--- a/gru_sac_predictor/models/run_20250416_150829/sac_agent/alpha.npy
+++ b/gru_sac_predictor/models/run_20250416_150829/sac_agent/alpha.npy
--- a/gru_sac_predictor/models/run_20250416_150829/sac_agent/critic_1.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_150829/sac_agent/critic_1.weights.h5
--- a/gru_sac_predictor/models/run_20250416_150829/sac_agent/critic_2.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_150829/sac_agent/critic_2.weights.h5
--- a/gru_sac_predictor/models/run_20250416_150829/sac_agent/target_critic_1.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_150829/sac_agent/target_critic_1.weights.h5
--- a/gru_sac_predictor/models/run_20250416_150829/sac_agent/target_critic_2.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_150829/sac_agent/target_critic_2.weights.h5
--- a/gru_sac_predictor/models/run_20250416_150924/sac_agent/actor.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_150924/sac_agent/actor.weights.h5
--- a/gru_sac_predictor/models/run_20250416_150924/sac_agent/alpha.npy
+++ b/gru_sac_predictor/models/run_20250416_150924/sac_agent/alpha.npy
--- a/gru_sac_predictor/models/run_20250416_150924/sac_agent/critic_1.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_150924/sac_agent/critic_1.weights.h5
--- a/gru_sac_predictor/models/run_20250416_150924/sac_agent/critic_2.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_150924/sac_agent/critic_2.weights.h5
--- a/gru_sac_predictor/models/run_20250416_150924/sac_agent/target_critic_1.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_150924/sac_agent/target_critic_1.weights.h5
--- a/gru_sac_predictor/models/run_20250416_150924/sac_agent/target_critic_2.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_150924/sac_agent/target_critic_2.weights.h5
--- a/gru_sac_predictor/models/run_20250416_151322/sac_agent/actor.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_151322/sac_agent/actor.weights.h5
--- a/gru_sac_predictor/models/run_20250416_151322/sac_agent/alpha.npy
+++ b/gru_sac_predictor/models/run_20250416_151322/sac_agent/alpha.npy
--- a/gru_sac_predictor/models/run_20250416_151322/sac_agent/critic_1.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_151322/sac_agent/critic_1.weights.h5
--- a/gru_sac_predictor/models/run_20250416_151322/sac_agent/critic_2.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_151322/sac_agent/critic_2.weights.h5
--- a/gru_sac_predictor/models/run_20250416_151322/sac_agent/target_critic_1.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_151322/sac_agent/target_critic_1.weights.h5
--- a/gru_sac_predictor/models/run_20250416_151322/sac_agent/target_critic_2.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_151322/sac_agent/target_critic_2.weights.h5
--- a/gru_sac_predictor/models/run_20250416_151849/sac_agent/actor.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_151849/sac_agent/actor.weights.h5
--- a/gru_sac_predictor/models/run_20250416_151849/sac_agent/alpha.npy
+++ b/gru_sac_predictor/models/run_20250416_151849/sac_agent/alpha.npy
--- a/gru_sac_predictor/models/run_20250416_151849/sac_agent/critic_1.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_151849/sac_agent/critic_1.weights.h5
--- a/gru_sac_predictor/models/run_20250416_151849/sac_agent/critic_2.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_151849/sac_agent/critic_2.weights.h5
--- a/gru_sac_predictor/models/run_20250416_151849/sac_agent/target_critic_1.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_151849/sac_agent/target_critic_1.weights.h5
--- a/gru_sac_predictor/models/run_20250416_151849/sac_agent/target_critic_2.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_151849/sac_agent/target_critic_2.weights.h5
--- a/gru_sac_predictor/models/run_20250416_152415/sac_agent/actor.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_152415/sac_agent/actor.weights.h5
--- a/gru_sac_predictor/models/run_20250416_152415/sac_agent/alpha.npy
+++ b/gru_sac_predictor/models/run_20250416_152415/sac_agent/alpha.npy
--- a/gru_sac_predictor/models/run_20250416_152415/sac_agent/critic_1.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_152415/sac_agent/critic_1.weights.h5
--- a/gru_sac_predictor/models/run_20250416_152415/sac_agent/critic_2.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_152415/sac_agent/critic_2.weights.h5
--- a/gru_sac_predictor/models/run_20250416_152415/sac_agent/target_critic_1.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_152415/sac_agent/target_critic_1.weights.h5
--- a/gru_sac_predictor/models/run_20250416_152415/sac_agent/target_critic_2.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_152415/sac_agent/target_critic_2.weights.h5
--- a/gru_sac_predictor/models/run_20250416_153132/sac_agent/actor.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_153132/sac_agent/actor.weights.h5
--- a/gru_sac_predictor/models/run_20250416_153132/sac_agent/alpha.npy
+++ b/gru_sac_predictor/models/run_20250416_153132/sac_agent/alpha.npy
--- a/gru_sac_predictor/models/run_20250416_153132/sac_agent/critic_1.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_153132/sac_agent/critic_1.weights.h5
--- a/gru_sac_predictor/models/run_20250416_153132/sac_agent/critic_2.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_153132/sac_agent/critic_2.weights.h5
--- a/gru_sac_predictor/models/run_20250416_153132/sac_agent/target_critic_1.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_153132/sac_agent/target_critic_1.weights.h5
--- a/gru_sac_predictor/models/run_20250416_153132/sac_agent/target_critic_2.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_153132/sac_agent/target_critic_2.weights.h5
--- a/gru_sac_predictor/models/run_20250416_153846/sac_agent/actor.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_153846/sac_agent/actor.weights.h5
--- a/gru_sac_predictor/models/run_20250416_153846/sac_agent/alpha.npy
+++ b/gru_sac_predictor/models/run_20250416_153846/sac_agent/alpha.npy
--- a/gru_sac_predictor/models/run_20250416_153846/sac_agent/critic_1.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_153846/sac_agent/critic_1.weights.h5
--- a/gru_sac_predictor/models/run_20250416_153846/sac_agent/critic_2.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_153846/sac_agent/critic_2.weights.h5
--- a/gru_sac_predictor/models/run_20250416_153846/sac_agent/target_critic_1.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_153846/sac_agent/target_critic_1.weights.h5
--- a/gru_sac_predictor/models/run_20250416_153846/sac_agent/target_critic_2.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_153846/sac_agent/target_critic_2.weights.h5
--- a/gru_sac_predictor/models/run_20250416_154636/sac_agent/actor.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_154636/sac_agent/actor.weights.h5
--- a/gru_sac_predictor/models/run_20250416_154636/sac_agent/alpha.npy
+++ b/gru_sac_predictor/models/run_20250416_154636/sac_agent/alpha.npy
--- a/gru_sac_predictor/models/run_20250416_154636/sac_agent/critic_1.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_154636/sac_agent/critic_1.weights.h5
--- a/gru_sac_predictor/models/run_20250416_154636/sac_agent/critic_2.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_154636/sac_agent/critic_2.weights.h5
--- a/gru_sac_predictor/models/run_20250416_154636/sac_agent/target_critic_1.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_154636/sac_agent/target_critic_1.weights.h5
--- a/gru_sac_predictor/models/run_20250416_154636/sac_agent/target_critic_2.weights.h5
+++ b/gru_sac_predictor/models/run_20250416_154636/sac_agent/target_critic_2.weights.h5
--- a/Show More
+++ b/Show More