Markdown Converter
Agent skill for markdown-converter
This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.
Sign in to like and favorite skills
This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.
Zorro is a comprehensive malware detection framework for package ecosystems with two complementary approaches:
icn/)The ICN system operates on two levels:
Key modules:
icn/models/icn_model.py - Main neural network architecture with convergence loopicn/training/ - Complete training pipeline with curriculum learningicn/evaluation/ - Comprehensive benchmarking framework against SOTA modelsicn/data/ - Data preparation and preprocessing pipelineicn/parsing/ - Code parsing, tokenization, and AST analysisicn/evaluation/)Comprehensive benchmarking system comparing ICN against:
Supports multiple evaluation modes:
amil/)Attention-based Multiple Instance Learning system for fast package classification:
Key modules:
amil/model.py - Core AMIL architecture with attention-based MIL poolingamil/trainer.py - 3-stage curriculum learning pipeline (balanced → augmented → realistic)amil/evaluator.py - Comprehensive evaluation (classification, localization, speed, robustness)amil/feature_extractor.py - Combined code embedding and handcrafted feature extractionamil_benchmark_integration.py - Integration with ICN evaluation framework# Full ICN training pipeline with curriculum learning python train_icn.py # Quick structure test python test_icn_structure.py # Verify complete pipeline python verify_icn_pipeline.py
# Quick benchmark test python test_benchmark_framework.py # Full benchmark study (requires OpenRouter API key) python run_icn_benchmark.py --include-llms --include-huggingface --include-baselines # LLM granularity comparison python icn/evaluation/test_granularity.py
# Extract malicious samples from dataset python extract_malicious_samples.py # ICN demo on sample data python icn_demo.py python icn_phase2_demo.py
# AMIL demo with comprehensive examples python amil_demo.py # AMIL model training (3-stage curriculum) python -m amil.trainer --config-file amil_config.json # AMIL comprehensive evaluation python -m amil.evaluator --model-path checkpoints/amil_model.pth --test-data data/test_samples/ # AMIL benchmark integration test python amil_benchmark_integration.py
# Install dependencies using uv uv sync # Add a new dependency uv add <package-name> # Update dependencies uv lock --upgrade
icn/training/config.py - model, curriculum, optimization settingsicn/training/wandb_config.py - W&B experiment trackingrun_icn_benchmark.py - evaluation configurationamil/config.py - AMIL model configuration, curriculum stages, feature settingsRequired environment variables:
GITHUB_TOKEN: GitHub API access (for advisory scraping)OPENROUTER_API_KEY: OpenRouter API for LLM benchmarking (optional)WANDB_API_KEY: W&B experiment tracking (optional)malicious-software-packages-dataset/ - Primary malware datasetdata/ - Processed training data, splits, embeddingscheckpoints/ - Model checkpoints and saved stateslogs/ - Training and evaluation logstest_results/ - Benchmark results and analysisICN uses a dual intent system:
net.outbound, fs.read, fs.write, proc.spawn, eval, crypto, sys.env, etc.Primary metrics for model comparison:
Production-ready targets:
Cross-ecosystem evaluation tests generalization between package managers.
uv for Python dependency management (not pip directly).venv/.. notation - run scripts from project root