python - How do I resolve ImportError Using bitsandbytes 4bit quantization requires the latest version of bitsandbytes despite h

admin•2025-04-19 08:18:56•questions•阅读2

I am trying to use the bitsandbytes library for 4-bit quantization in my model loading function, but I

I am trying to use the bitsandbytes library for 4-bit quantization in my model loading function, but I keep encountering an ImportError. The error message says, "Using bitsandbytes 4-bit quantization requires the latest version of bitsandbytes," even though I have already installed version 0.45.3.

I have confirmed that bitsandbytes is installed by running pip show bitsandbytes, and it shows the correct version. I have also tried upgrading it using pip install -U bitsandbytes, but the error persists. Additionally, I have imported bitsandbytes at the beginning of my script (import bitsandbytes as bnb), yet the issue continues.

Is there any other configuration or setup I need to follow to resolve this issue? Any suggestions on how to get this working would be greatly appreciated!

here is my code where i am using it:

def get_model(model = CFG.model_name):

    print('\nDownloading model: ', model, '\n\n')

    if model == 'wizardlm':
        model_repo = 'TheBloke/wizardLM-7B-HF'
        
        tokenizer = AutoTokenizer.from_pretrained(model_repo)
        
        bnb_config = bnb.BitsAndBytesConfig(
            load_in_4bit=True,
            bnb_4bit_quant_type="nf4",
            bnb_4bit_compute_dtype=torch.float16,
            bnb_4bit_use_double_quant=True,
        )
       

        model = AutoModelForCausalLM.from_pretrained(
            model_repo,
            quantization_config = bnb_config,
            device_map = 'auto',
            low_cpu_mem_usage = True
        )
        
        max_len = 1024

    elif model == 'llama2-7b-chat':
        model_repo = 'daryl149/llama-2-7b-chat-hf'
        
        tokenizer = AutoTokenizer.from_pretrained(model_repo, use_fast=True)
        
        bnb_config = BitsAndBytesConfig(
            load_in_4bit = True,
            bnb_4bit_quant_type = "nf4",
            bnb_4bit_compute_dtype = torch.float16,
            bnb_4bit_use_double_quant = True,
        )
        
        model = AutoModelForCausalLM.from_pretrained(
            model_repo,
            quantization_config = bnb_config,
            device_map = 'auto',
            low_cpu_mem_usage = True,
            trust_remote_code = True
        )
        
        max_len = 2048

    elif model == 'llama2-13b-chat':
        model_repo = 'daryl149/llama-2-13b-chat-hf'
        
        tokenizer = AutoTokenizer.from_pretrained(model_repo, use_fast=True)
        
        bnb_config = BitsAndBytesConfig(
            load_in_4bit = True,
            bnb_4bit_quant_type = "nf4",
            bnb_4bit_compute_dtype = torch.float16,
            bnb_4bit_use_double_quant = True,
        )
                
        model = AutoModelForCausalLM.from_pretrained(
            model_repo,
            quantization_config = bnb_config,       
            device_map = 'auto',
            low_cpu_mem_usage = True,
            trust_remote_code = True
        )
        
        max_len = 2048 # 8192

    elif model == 'mistral-7B':
        model_repo = 'mistralai/Mistral-7B-v0.1'
        
        tokenizer = AutoTokenizer.from_pretrained(model_repo)
        
        bnb_config = BitsAndBytesConfig(
            load_in_4bit = True,
            bnb_4bit_quant_type = "nf4",
            bnb_4bit_compute_dtype = torch.float16,
            bnb_4bit_use_double_quant = True,
        )        

        model = AutoModelForCausalLM.from_pretrained(
            model_repo,
            quantization_config = bnb_config,
            device_map = 'auto',
            low_cpu_mem_usage = True,
        )
        
        max_len = 1024

    else:
        print("Not implemented model (tokenizer and backbone)")

    return tokenizer, model, max_len

and where the error is coming from:

    tokenizer, model, max_len = get_model(model = CFG.model_name)

I tried installing and upgrading the bitsandbytes library to the latest version (0.45.3) using the following command:

pip install -U bitsandbytes After this, I checked that the library was installed correctly by running:

pip show bitsandbytes I expected that after upgrading, the error related to the 4-bit quantization would be resolved, and the model loading function would work properly without raising an ImportError. However, despite upgrading and confirming the installation, I still encounter the same error:

ImportError: Using bitsandbytes 4-bit quantization requires the latest version of bitsandbytes

发布者：admin，转转请注明出处：http://www.yc00.com/questions/1744800067a4594437.html

admin

questions
javascript - Plugin with id 'kotlin-android' not found on OnboardingScreens - Stack Overflow
I am new to react native and was trying to create a Onboarding Screen for an app. After adding the code
admin
26分钟前
00
questions
javascript - How do I reference a styled-component that is a functional component? - Stack Overflow
This is the most basic example I could think of:import React from 'react';import {css,} fr
admin
25分钟前
10
questions
java - Finding HTML elements to use in Selenium WebDriver - Stack Overflow
Using Selenium WebDriver in a java class where I try to find that specific element and then automatical
admin
23分钟前
00
questions
Set different heights per screen size with inline styles CSS into the 'Text' section of a Page
I embedded a 360 Virtual Tour on a Wordpress page using the following code:<div align="center"><div
admin
22分钟前
00
questions
javascript - Can I use select2 purely as a tokenizer, disabling the dropdownsearchmatch functionality? - Stack Overflow
I have multiple instances of select2 elements in my form, but in one of them (which is on a hidden inpu
admin
21分钟前
20
questions
Form element name - array type is not working
I have a form with multi-select field:<select <?php echo esc_attr($item['pgggo_grid_sort_and_filter_multisele
admin
18分钟前
00
questions
Where can I see AWS Glue ETL job print statements? - Stack Overflow
ProblemWhere do you find print statements from your Glue ETL jobs? You guys, this is killing me. Why i
admin
18分钟前
00
questions
javascript - module.exports function is not a function - Stack Overflow
I'm trying to require an endpoints.js file into my webpack.config.jsExpectedendpoints.js gets re
admin
17分钟前
00
questions
plugins - How to send new visitor to a splash page for only one time in wordpress?
I have a PHP script that will be the main web platform and would like our NEW visitors that visit our domain to be sent
admin
16分钟前
00
questions
vba - Error with Excel Lambda function in Name manager - Stack Overflow
I have set up a Lambda formula that calculates interpolated values from a given set of coordinatessimil
admin
15分钟前
00
questions
Is it possible to override property types when extending a component in ColdFusion 2023? - Stack Overflow
I am testing code that works in CF 11 for an upgrade to CF 2023. I have a component that extends anothe
admin
15分钟前
10
questions
javascript - jQuery - check if .noconflict has been triggered - Stack Overflow
I'm loading a jQuery script dynamically into random pages.Sometimes they support jQuery, sometime
admin
14分钟前
10
questions
loop through 2D array horizontally - javascript - Stack Overflow
how do loop through this 3x3 grid of arrays horizontally and print out 1, 4, 7, 2, 5, 8, 3, 6, 9?edit:
admin
13分钟前
00
questions
javascript - Unit Testing Vue 3 Component that uses Pinia with Vue Testing Library - Stack Overflow
I am struggling to understand how I can test the rendering of items based on the results of a call to a
admin
11分钟前
00
questions
php - DOMPDF image rendering issue - Stack Overflow
As stated in the title i'm currently using Laravel DOMPDF that extends from DOMPDF library to gene
admin
7分钟前
10
questions
javascript - fusioncharts - Update FusionCharts data without changing chart settings - Stack Overflow
Is there a way to set Fusionchart graph "data" property only. Because currently when you set
admin
6分钟前
00
questions
asp.net core - .NET, Entity Framework, C#: cannot insert explicit value for identity column - Stack Overflow
I'm just trying to add a table ProcessingFlags to my database (mapped to a corresponding JobRunnin
admin
6分钟前
00
questions
javascript - Fancybox - How to load an image from a url? - Stack Overflow
I wonder how I can create a lightbox which loads the image in page with the urls of the linksHere is m
admin
5分钟前
10
questions
How can I bulk delete media and attachments using WP-CLI?
I'm trying to mass delete 4000 images in a wordpress website. WP itself sets the max to 999, which would work fine
admin
60秒前
00
questions
javascript - Is it possible to create element on the fly with jQuery Mobile? - Stack Overflow
I have an app built using jQuery (and using various jQuery-UI tools).For some reason, i have to port it
admin
43秒前
00

发表回复

评论列表（0条）

暂无评论

python - How do I resolve ImportError Using bitsandbytes 4bit quantization requires the latest version of bitsandbytes despite h

发表回复

评论列表（0条）

联系我们

400-800-8888

python - How do I resolve ImportError Using bitsandbytes 4bit quantization requires the latest version of bitsandbytes despite h

相关推荐

发表回复

评论列表（0条）

联系我们

400-800-8888