Skip to content
View AmanPriyanshu's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Organizations

@Privacy-Engineering-CMU

Block or report AmanPriyanshu

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Investigating attacks using Splunk Enterprise logs and creating SPL intrusion detection searches based on known attacker TTPs and anomaly behavior derived from statistical baselines

29 4 Updated Nov 19, 2023

A curated list of awesome privilege escalation

1,494 169 Updated Aug 20, 2025
HTML 182 67 Updated Dec 13, 2025

IoT HackBot: A collection of Claude Skills and custom tooling for hybrid IoT pentesting

Python 466 82 Updated Dec 24, 2025

Use Hugging Face with JavaScript

TypeScript 2,312 577 Updated Dec 29, 2025

my notes

Python 235 63 Updated Nov 27, 2025

Elastic Malware Benchmark for Empowering Researchers

Jupyter Notebook 1,110 304 Updated Nov 22, 2024
Jupyter Notebook 63 17 Updated Nov 21, 2025

Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging

Python 116 10 Updated Oct 23, 2023

An adaptive sampling framework for Reinforce-style LLM post training.

Python 86 15 Updated Nov 29, 2025

Awesome Mixture of Experts (MoE): A Curated List of Mixture of Experts (MoE) and Mixture of Multimodal Experts (MoME)

51 5 Updated Oct 6, 2025

The Foundation AI Testing Hub (FAITH) is a benchmarking tool used to assess LLMs competency on (cybersecurity) knowledge/tasks.

Python 12 1 Updated Dec 29, 2025

BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.

Jupyter Notebook 233 21 Updated Sep 2, 2025

This benchmark tests how well LLMs incorporate a set of 10 mandatory story elements (characters, objects, core concepts, attributes, motivations, etc.) in a short creative story

Batchfile 327 7 Updated Dec 16, 2025

Lightning fast data version control system for structured and unstructured machine learning datasets. We aim to make versioning datasets as easy as versioning code.

Rust 1,070 21 Updated Jan 1, 2026

My personal blog

HTML 1 Updated Mar 15, 2024

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 4,355 368 Updated Jan 1, 2026

On the Theoretical Limitations of Embedding-Based Retrieval

Jupyter Notebook 614 47 Updated Sep 15, 2025

A benchmark for LLMs on complicated tasks in the terminal

Python 1,276 441 Updated Dec 26, 2025

Generating, validating and running exploitable verifiable coding problems

Python 8 Updated Dec 11, 2025

Fast, Flexible and Portable Structured Generation

C++ 1,447 117 Updated Jan 1, 2026

The lean application framework for Python. Build sophisticated user interfaces with a simple Python API. Run your apps in the terminal and a web browser.

Python 33,274 1,054 Updated Dec 20, 2025

Official implementation of the NeurIPS 2025 paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"

Python 292 34 Updated Dec 12, 2025

Data about all known supply-chain attacks through history

JavaScript 63 1 Updated May 28, 2025
Python 132 8 Updated Dec 9, 2025

A curated collection of privacy-preserving machine learning techniques, tools, and practical evaluations. Focuses on differential privacy, federated learning, secure computation, and synthetic data…

4 Updated Jun 9, 2025

SynthTextEval: A Toolkit for Generating and Evaluating Synthetic Data For High-Stakes Domains (EMNLP 2025 System Demonstration)

Python 25 2 Updated Nov 3, 2025

A curated list of tools, papers, and datasets for applying AI to cybersecurity tasks. This list primarily focuses on modern AI technologies like Large Language Models (LLMs), Agents, and Multi-Moda…

105 8 Updated Dec 14, 2025
Next