Files
velero/design/wildcard-namespace-support-design.md
Joseph eb8b382816 Update design
Signed-off-by: Joseph <jvaikath@redhat.com>
2025-08-19 17:44:45 -07:00

11 KiB

Wildcard Namespace Includes/Excludes Support for Backups and Restores

Abstract

Velero currently does not support wildcard characters in namespace specifications, requiring all namespaces to be specified as string literals. The only exception is the standalone "*" character, which includes all namespaces and ignores excludes.

This document details the approach to implementing wildcard namespace support for --include-namespaces and --exclude-namespaces flags, while preserving the existing "*" behavior for backward compatibility.

Background

This feature was requested in Issue #1874 to provide more flexible namespace selection using wildcard patterns.

Goals

  • Add support for wildcard patterns in --include-namespaces and --exclude-namespaces flags for both backup and restore
  • Ensure legacy "*" behavior remains unchanged for backward compatibility

Non-Goals

  • Completely rethinking the way "*" is treated and allowing it to work with wildcard excludes
  • Supporting complex regex patterns beyond basic glob patterns

High-Level Design

Backup

The wildcard expansion implementation focuses on two key functions in pkg/backup/item_collector.go:

The collectNamespaces function is the ideal integration point because it:

  • Already retrieves all active namespaces from the cluster
  • Processes the user-specified namespace filters
  • Can expand wildcard patterns against the complete namespace list
  • Stores the resolved namespaces in new backup status fields for visibility

This approach ensures wildcard namespaces are handled consistently with the existing "*" behavior, bypassing individual namespace existence checks.

Restore

The wildcard expansion implementation for restore operations focuses on the main execution flow in pkg/restore/restore.go:

  • execute - Main restore execution that parses backup contents and processes namespace filters
  • extractNamespacesFromBackup - Extracts available namespaces from backup tar contents

The execute function is the ideal integration point because it:

  • Already parses the backup tar file to understand available resources
  • Processes the user-specified namespace filters for the restore operation
  • Can expand wildcard patterns against namespaces that actually exist in the backup
  • Stores the resolved namespaces in new restore status fields for visibility

This approach ensures wildcard namespaces in restore operations are based on actual backup contents rather than original backup specifications, providing safety and consistency regardless of how the backup was created.

Detailed Design

The implementation involves four main components that can be developed incrementally:

Add new status fields to the backup and restore CRDs to store expanded wildcard namespaces

// BackupStatus captures the current status of a Velero backup.
type BackupStatus struct {
    // ... existing fields ...
    
    // IncludeWildcardMatches records the expanded include wildcard namespaces
    // +optional
    // +nullable
    IncludeWildcardMatches []string `json:"includeWildcardMatches,omitempty"`

    // ExcludeWildcardMatches records the expanded exclude wildcard namespaces
    // +optional
    // +nullable
    ExcludeWildcardMatches []string `json:"excludeWildcardMatches,omitempty"`

    // WildcardResult records the final namespaces after applying wildcard include/exclude logic
	// +optional
	// +nullable
	WildcardResult []string `json:"wildcardResult,omitempty"`
    
    // ... other fields ...
}

Implementation: Added status fields IncludeWildcardMatches, ExcludeWildcardMatches, and WildcardResult to pkg/apis/velero/v1/backup_types.go and pkg/apis/velero/v1/restore_types.go to track the resolved namespace lists after wildcard expansion.

Create a util package for wildcard expansion

Implementation: Created pkg/util/wildcard/expand.go package containing:

  • ShouldExpandWildcards(includes, excludes []string) bool - Determines if wildcard expansion is needed (excludes simple "*" case)
  • ExpandWildcards(activeNamespaces, includes, excludes []string) ([]string, []string, error) - Main expansion function
  • containsWildcardPattern(pattern string) bool - Detects wildcard patterns (*, ?, [abc], {a,b,c})
  • validateWildcardPatterns(patterns []string) error - Validates patterns and rejects unsupported regex symbols
  • Uses github.com/gobwas/glob library for glob pattern matching

Supported patterns:

  • * (any characters)
  • ? (single character)
  • [abc] (character classes)
  • {a,b,c} (alternatives)

Unsupported: Regex symbols |(), consecutive asterisks **

If required, expand wildcards and replace the request's includes and excludes with expanded namespaces

Backup:

Implementation: In pkg/backup/item_collector.go:

// collectNamespaces function (line 748-803)
if wildcard.ShouldExpandWildcards(namespaceSelector.GetIncludes(), namespaceSelector.GetExcludes()) {
    if err := r.expandNamespaceWildcards(activeNamespacesList, namespaceSelector); err != nil {
        return nil, errors.WithMessage(err, "failed to expand namespace wildcard patterns")
    }
}

The expansion occurs when collecting namespaces, after retrieving all active namespaces from the cluster. The expandNamespaceWildcards method:

  • Calls wildcard.ExpandWildcards() with active namespaces and original patterns
  • Updates the namespace selector with expanded results using SetIncludes() and SetExcludes()
  • Preserves backward compatibility by skipping expansion for simple "*" pattern

Performance Improvement: As part of this implementation, active namespaces are stored in a hashset rather than being iterated for each resolved/literal namespace check. This eliminates a nested loop anti-pattern and improves performance.

Restore

Implementation: In pkg/restore/restore.go:

// Lines 478-509: Wildcard expansion in restore context
if wildcard.ShouldExpandWildcards(ctx.restore.Spec.IncludedNamespaces, ctx.restore.Spec.ExcludedNamespaces) {
    availableNamespaces := extractNamespacesFromBackup(backupResources)
    expandedIncludes, expandedExcludes, err := wildcard.ExpandWildcards(
        availableNamespaces,
        ctx.restore.Spec.IncludedNamespaces, 
        ctx.restore.Spec.ExcludedNamespaces,
    )
    // Update restore context with expanded patterns
    ctx.namespaceIncludesExcludes = collections.NewIncludesExcludes().
        Includes(expandedIncludes...).
        Excludes(expandedExcludes...)
}

The restore expansion occurs after parsing the backup tar contents, using extractNamespacesFromBackup to determine which namespaces are actually available for restoration. This ensures wildcard patterns are applied against materialized backup contents rather than original backup specifications.

Populate the expanded namespace status field with the namespaces

Backup Status Fields

Implementation: In expandNamespaceWildcards function (line 889-891):

// Record the expanded wildcard includes/excludes in the request status
r.backupRequest.Status.IncludeWildcardMatches = expandedIncludes
r.backupRequest.Status.ExcludeWildcardMatches = expandedExcludes
r.backupRequest.Status.WildcardResult = wildcardResult

Restore Status Fields

Implementation: In pkg/restore/restore.go (lines 499-502):

// Record the expanded wildcard includes/excludes in the restore status
ctx.restore.Status.IncludeWildcardMatches = expandedIncludes
ctx.restore.Status.ExcludeWildcardMatches = expandedExcludes
ctx.restore.Status.WildcardResult = wildcardResult

The status fields are populated immediately after successful wildcard expansion, providing visibility into which namespaces were actually matched by the wildcard patterns and the final list of namespaces that will be processed.

Alternatives Considered

Several implementation approaches were considered for the wildcard expansion logic:

1. Wildcard expansion in getNamespacesToList

This approach was ruled out because:

  • The collectNamespaces function encounters wildcard patterns first in the processing flow
  • collectNamespaces already has access to the complete list of active namespaces, eliminating the need for additional API calls
  • Placing the logic in getNamespacesToList would require redundant namespace retrieval

2. Client-side wildcard expansion

Expanding wildcards on the CLI side was considered but rejected because:

  • It would only work for command-line initiated backups
  • Scheduled backups would not benefit from this approach
  • The server-side approach provides consistent behavior across all backup initiation methods

Security Considerations

This feature does not introduce any security vulnerabilities as it only affects namespace selection logic within the existing backup authorization framework.

Compatibility

Backward Compatibility

The implementation maintains full backward compatibility with existing behavior:

  • The standalone "*" character continues to work as before (include all namespaces, ignore excludes)
  • Existing backup configurations remain unaffected

Known Limitations

  1. Mixed "*" and wildcard usage: When using the standalone "" in includes, the legacy implementation takes precedence and wildcard expansion is skipped. This means you cannot combine "" (all namespaces) in includes with wildcard patterns in excludes.

  2. Selective exclusion limitation: The current design does not support selective pattern-based exclusion when including all namespaces via "*".

Implementation

The implementation follows the detailed design outlined above, with the following timeline:

  1. Phase 1: Add backup CRD status fields for expanded namespaces
  2. Phase 2: Implement wildcard utility package with validation
  3. Phase 3: Integrate expansion logic into backup item collection
  4. Phase 4: Add status field population and logging

Open Issues

Restore Integration: Restore operations are completely decoupled from backup wildcard specifications and work safely with wildcard-created backups. The restore process reads the actual tar file contents to determine available namespaces, not the original backup spec. Since the tar file contains real, materialized namespaces (e.g., test01, test02) rather than wildcard patterns (e.g., test*), restore operations work with concrete namespace names.

When wildcard patterns are specified in restore operations, they are expanded against the namespaces that actually exist in the backup tar file. This ensures that restore wildcard expansion is based on what was actually backed up, not what was originally intended to be backed up.

Crucially, restore does not treat * as a catch all case. This means that if * is mentioned in restore, wildcard expansion is skipped owing to how it's treated in backup.