AI & ML interests
Enterprise AI and ML, Foundation Models, Responsible AI
Recent Activity
Papers
VAREX: A Benchmark for Multi-Modal Structured Extraction from Documents
Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs
Articles
-
ibm-research/ttm-research-r2
Time Series Forecasting • 855k • Updated • 4.84k • 6 -
ibm-research/ttm-r3
Time Series Forecasting • 1.41M • Updated • 34.3k • 4 -
ibm-research/flowstate
Time Series Forecasting • 9.07M • Updated • 328k • 8 -
ibm-research/patchtst-fm-r1
Time Series Forecasting • 0.3B • Updated • 22k • 8
-
AssetOpsBench: Benchmarking AI Agents for Task Automation in Industrial Asset Operations and Maintenance
Paper • 2506.03828 • Published • 20 -
FailureSensorIQ: A Multi-Choice QA Dataset for Understanding Sensor Relationships and Failure Modes
Paper • 2506.03278 • Published • 7 -
ibm-research/AssetOpsBench
Viewer • Updated • 467 • 582 • 19 -
AssetOpsBench
📉4Evaluating Autonomous AI Agents for Industry 4.0 Tasks
-
AssetOpsBench
🚀19Generate and benchmark machine learning models with ease
-
CUGA Agent
🤖98Configurable Generalist Agent, leader in AppWorld Benchmark
-
ITBench-Lite-Space
🚀7Develop and run interactive code notebooks with JupyterLab
-
VAKRA Leaderboard
🏆18Benchmark AI agents on multi‑hop, multi‑source enterprise tasks
-
ibm-research/granite-3.2-2b-instruct-GGUF
Text Generation • 3B • Updated • 1.6k • 12 -
ibm-research/granite-3.2-8b-instruct-GGUF
Text Generation • 8B • Updated • 1.29k • 9 -
ibm-research/granite-vision-3.2-2b-GGUF
3B • Updated • 406 • 12 -
ibm-research/granite-guardian-3.2-3b-a800m-GGUF
Text Generation • 3B • Updated • 129 • 3
-
ibm-research/ttm-research-r2
Time Series Forecasting • 855k • Updated • 4.84k • 6 -
ibm-research/ttm-r3
Time Series Forecasting • 1.41M • Updated • 34.3k • 4 -
ibm-research/flowstate
Time Series Forecasting • 9.07M • Updated • 328k • 8 -
ibm-research/patchtst-fm-r1
Time Series Forecasting • 0.3B • Updated • 22k • 8
-
AssetOpsBench
🚀19Generate and benchmark machine learning models with ease
-
CUGA Agent
🤖98Configurable Generalist Agent, leader in AppWorld Benchmark
-
ITBench-Lite-Space
🚀7Develop and run interactive code notebooks with JupyterLab
-
VAKRA Leaderboard
🏆18Benchmark AI agents on multi‑hop, multi‑source enterprise tasks
-
AssetOpsBench: Benchmarking AI Agents for Task Automation in Industrial Asset Operations and Maintenance
Paper • 2506.03828 • Published • 20 -
FailureSensorIQ: A Multi-Choice QA Dataset for Understanding Sensor Relationships and Failure Modes
Paper • 2506.03278 • Published • 7 -
ibm-research/AssetOpsBench
Viewer • Updated • 467 • 582 • 19 -
AssetOpsBench
📉4Evaluating Autonomous AI Agents for Industry 4.0 Tasks
-
ibm-research/granite-3.2-2b-instruct-GGUF
Text Generation • 3B • Updated • 1.6k • 12 -
ibm-research/granite-3.2-8b-instruct-GGUF
Text Generation • 8B • Updated • 1.29k • 9 -
ibm-research/granite-vision-3.2-2b-GGUF
3B • Updated • 406 • 12 -
ibm-research/granite-guardian-3.2-3b-a800m-GGUF
Text Generation • 3B • Updated • 129 • 3