Skip to content
This repository was archived by the owner on Oct 25, 2024. It is now read-only.

Pull requests: intel/intel-extension-for-transformers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[NeuralChat] Add Multi-Socket LLM Inference Example draft
#1073 opened Dec 25, 2023 by letonghan Loading… updated Jan 26, 2024
[NeuralChat] Support Assisted Generation on Multi-nodes draft
#1283 opened Feb 19, 2024 by letonghan Draft updated Mar 27, 2024
Removed fallback for lm_head op WIP
#1482 opened Apr 15, 2024 by PenghuiCheng Loading… updated Apr 16, 2024
[NeuralChat] Enable RAG's table extraction and summary NeuralChat
#1417 opened Mar 25, 2024 by xmx-521 Loading… updated May 8, 2024
Feature/support older intel mac book pro with gcc 13
#1085 opened Dec 27, 2023 by nezda Draft updated May 13, 2024
[NeuralChat] Support user management in backend server NeuralChat
#952 opened Dec 17, 2023 by lvliang-intel Loading… updated May 13, 2024
add FP8Config habana
#1442 opened Apr 1, 2024 by mengniwang95 Loading… updated May 13, 2024
[NeuralChat] Add new customized chabot UI NeuralChat
#1455 opened Apr 3, 2024 by lvliang-intel Loading… updated May 13, 2024
lm-eval for llama.cpp enhancement.
#1543 opened May 12, 2024 by lkk12014402 Loading… updated May 26, 2024
[NeuralChat] RAG evaluation NeuralChat
#1333 opened Mar 1, 2024 by Liangyx2 Loading… updated Jun 3, 2024
Gaudi Tensor split for memory optimization
#1575 opened May 29, 2024 by ClarkChin08 Loading… updated Jun 7, 2024
Enable vllm backend.
#1602 opened Jun 11, 2024 by ZePan110 Loading… updated Jun 17, 2024
Support INC layerwise quant
#1623 opened Jun 19, 2024 by changwangss Loading… updated Jun 19, 2024
Bump neural docker base
#1633 opened Jun 21, 2024 by Chris-Sigopt Loading… updated Jun 27, 2024
update ipex api
#1650 opened Jul 3, 2024 by LJ-underdog Draft updated Jul 5, 2024
Enable modelscope for itrex
#1655 opened Jul 5, 2024 by LJ-underdog Loading… updated Jul 9, 2024
A beginner friendly quantize and text embeddings tutorial for XPUs
#1663 opened Jul 14, 2024 by sleepingcat4 Loading… updated Jul 15, 2024
Support lmhead int4
#1670 opened Jul 17, 2024 by a32543254 Draft updated Jul 17, 2024
qbits deprecate clip postfix
#1672 opened Jul 17, 2024 by zhewang1-intc Draft updated Jul 18, 2024
Bump langchain-community from 0.0.27 to 0.2.5 in /intel_extension_for_transformers/neural_chat/tests 1.5 dependencies Pull requests that update a dependency file python Pull requests that update Python code
#1613 opened Jun 14, 2024 by dependabot bot Loading… updated Jul 22, 2024
Bump langchain-community from 0.0.27 to 0.2.9 in /intel_extension_for_transformers/neural_chat/pipeline/plugins/retrieval dependencies Pull requests that update a dependency file python Pull requests that update Python code
#1678 opened Jul 24, 2024 by dependabot bot Loading… updated Jul 24, 2024
Bump torch from 1.13.1 to 2.2.0 in /workflows/compression_aware_training dependencies Pull requests that update a dependency file python Pull requests that update Python code
#1679 opened Jul 25, 2024 by dependabot bot Loading… updated Jul 25, 2024
Update utils.py
#1691 opened Aug 23, 2024 by changwangss Loading… updated Aug 23, 2024
Add readme for inc 3.0 xpu device usage
#1693 opened Aug 26, 2024 by changwangss Loading… updated Aug 26, 2024
Bump langchain from 0.1.11 to 0.2.10 in /intel_extension_for_transformers/neural_chat/tests dependencies Pull requests that update a dependency file python Pull requests that update Python code
#1698 opened Sep 17, 2024 by dependabot bot Loading… updated Sep 17, 2024
ProTip! Type g i on any issue or pull request to go back to the issue listing page.